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COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS 
OF BREAST CANCER AND METHODS FOR THEIR USE 

TECHNICAL FIELD 

The present invention relates generally to compositions and methods for the 
treatment and diagnosis of breast cancer. The invention is more particularly related to 
polypeptides comprising at least a portion of a protein that is preferentially expressed in 
breast tumor tissue and to polynucleotide molecules encoding such polypeptides. Such 
polypeptides may be used in vaccines and pharmaceutical compositions for treatment of 
breast cancer. Additionally such polypeptides and polynucleotides may be used in the 
immunodiagnosis of breast cancer. 

BACKGROUND OF THE INVENTION 

Breast cancer is a significant health problem for women in the United States 
and throughout the world. Although advances have been made in detection and treatment of 
the disease, breast cancer remains the second leading cause of cancer-related deaths in 
women, affecting more than 1 80,000 women in the United States each year. For women in 
North America, the life-time odds of getting breast cancer are now one in eight. 

No vaccine or other universally successful method for the prevention or 
treatment of breast cancer is currently available. Management of the disease currently relies 
on a combination of early diagnosis (through routine breast screening procedures) and 
aggressive treatment, which may include one or more of a variety of treatments such as 
surgery, radiotherapy, chemotherapy and hormone therapy. The course of treatment for a 
particular breast cancer is often selected based on a variety of prognostic parameters, 
including an analysis of specific tumor markers. See^ e.g., Porter-Jordan and Lippman, 
Breast Cancer 5:73-100 (1994). However, the use of established markers often leads to a 
result that is difficult to interpret, and the high mortality observed in breast cancer patients 
indicates that improvements are needed in the treatment, diagnosis and prevention of the 
disease. 



BNSDOCiD: <WO 9933869A2_I_> 



wo 99/33869 



PCT/US98/27416 



2 

Accordingly, there is a need in the art for improved methods for therapy and 
diagnosis of breast cancer. The present invention fulfills these needs and further provides 
other related advantages. 

SUMMARY OF THE INVENTION 

The present invention provides compounds and methods for immunotherapy 
of breast cancer. In one aspect, isolated polypeptides are provided comprising at least an 
immunogenic portion of a breast tumor protein or a variant of said protein that differs only in 
conservative substitutions and/or modifications, wherein the breast tumor protein comprises 
an amino acid sequence encoded by a polynucleotide molecule having a partial sequence 
selected from the group consisting of (a) nucleotide sequences recited in SEQ ID NOS: 3, 10, 
17, 24, 45-52 and 55-67, 72, 73, and 89-94, (b) complements of said nucleotide sequences 
and (c) sequences that hybridize to a sequence of (a) or (b) under moderately stringent 
conditions. 

In related aspects, isolated polynucleotide molecules encoding the above 
polypeptides are provided. In specific embodiments, such polynucleotide molecules have 
partial sequences provided in SEQ ID NOS: 3, 10, 17, 24, 45-52 and 55-67, 72, 73, and 89- 
94. The present invention farther provides expression vectors comprising the above 
polynucleotide molecules and host cells transformed or transfected with such expression 
vectors. In preferred embodiments, the host cells are selected from the group consisting of E. 
coliy yeast and mammalian cells. 

In another aspect, the present invention provides fusion proteins comprising a 
first and a second inventive polypeptide or, alternatively, an inventive polypeptide and a 
known breast antigen. 

The present invention also provides pharmaceutical compositions comprising 
at least one of the above polypeptides, or a polynucleotide molecule encoding such a 
polypeptide, and a physiologically acceptable carrier, together with vaccines comprising at 
least one or more such polypeptide or polynucleotide molecule in combination with a non- 
specific immune response enhancer. Pharmaceutical compositions and vaccines comprising 
one or more of the above fusion proteins are also provided. 
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In related aspects, pharmaceutical compositions for the treatment of breast 
cancer comprising at least one polypeptides and a physiologically acceptable carrier are 
provided, wherein the polypeptide comprises an immunogenic portion of a breast tumor 
protein or a variant thereof, the breast tumor protein being encoded by a polynucleotide 
molecule having a partial sequence selected from the group consisting of: (a) nucleotide 
sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88, 
(b) complements of said nucleotide sequences, and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. The invention also provides vaccines for 
the treatment of breast cancer comprising such polypeptides in combination with a non- 
specific immune response enhancer, together with pharmaceutical compositions and vaccines 
comprising at least one polynucleotide molecule having a partial sequence provided in SEQ 
ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88. 

In yet another aspect, methods are provided for inhibiting the development of 
breast cancer in a patient, comprising administering an effective amount of at least one of the 
above pharmaceutical compositions and/or vaccines. 

The present invention also provides methods for immunodiagnosis of breast 
cancer, together with kits for use in such methods. In one specific aspect of the present 
invention, methods are provided for detecting breast cancer in a patient, comprising: (a) 
contacting a biological sample obtained from a patient with a binding agent that is capable of 
binding to one of the inventive polypeptides; and (b) detecting in the sample a protein or 
polypeptide that binds to the binding agent. In preferred embodiments, the binding agent is 
an antibody, most preferably a monoclonal antibody. 

In related aspects, methods are provided for monitoring the progression of 
breast cancer in a patient, comprising: (a) contacting a biological sample obtained from a 
patient with a binding agent that is capable of binding to one of the above polypeptides; (b) 
determining in the sample an amount of a protein or polypeptide that binds to the binding 
agent; (c) repeating steps (a) and (b); and comparing the amounts of polypeptide detected in 
steps (b) and (c). 

Within related aspects, the present invention provides antibodies, preferably 
monoclonal antibodies, that bind to the inventive polypeptides, as well as diagnostic kits 
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comprising such antibodies, and methods of using such antibodies to inhibit the development 
of breast cancer. 

The present invention further provides methods for detecting breast cancer 
comprising: (a) obtaining a biological sample from a patient; (b) contacting the sample with 
a first and a second oligonucleotide primer in a polymerase chain reaction, at least one of the 
oligonucleotide primers being specific for a DNA molecule that encodes one of the above 
polypeptides; and (c) detecting in the sample a DNA sequence that amplifies in the presence 
of the first and second oligonucleotide primers. In a preferred embodiment, at least one of the 
oligonucleotide primers comprises at least about 10 contiguous nucleotides of a DNA 
molecule having a partial sequence selected from the group consisting of SEQ ID NOS: 1-94. 

In a further aspect, the present invention provides a method for detecting 
breast cancer in a patient comprising: (a) obtaining a biological sample from the patient; (b) 
contacting the sample w^ith an oligonucleotide probe specific for a polynucleotide molecule 
that encodes one of the above polypeptides; and (c) detecting in the sample a polynucleotide 
sequence that hybridizes to the oligonucleotide probe. Preferably, the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a DNA molecule having a partial 
sequence selected from the group consisting of SEQ ID NOS: 1-94. 

In related aspects, diagnostic kits comprising the above oligonucleotide probes 
or primers are provided. 

These and other aspects of the present invention will become apparent upon 
reference to the following detailed description. All references disclosed herein are hereby 
incorporated by reference in their entirety as if each was incorporated individually. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1 A and B show the specific lytic activity of a first and a second B51 IS-specific 
CTL clone, respectively, measured on autologous LCL transduced with B511s (filled 
squares) or HLA-A3 (open squares). 
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DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to compositions 
and methods for the immunotherapy and diagnosis of breast cancer. The inventive 
compositions are generally isolated polypeptides that comprise at least a portion of a breast 
tumor protein. Also included within the present invention are molecules (such as an antibody 
or fragment thereof) that bind to the inventive polypeptides. Such molecules are referred to 
herein as "binding agents." 

In particular, the subject invention discloses polypeptides comprising at least a 
portion of a human breast tumor protein, or a variant thereof, wherein the breast tumor protein 
includes an amino acid sequence encoded by a polynucleotide molecule including a 
sequence selected from the group consisting of: nucleotide sequences recited in SEQ ID 
NOS: 1- 94, the complements of said nucleotide sequences, and variants thereof. As used 
herein, the term "polypeptide" encompasses amino acid chains of any length, including full 
length proteins, wherein the amino acid residues are linked by covalent peptide bonds. Thus, 
a polypeptide comprising a portion of one of the above breast proteins may consist entirely of 
the portion, or the portion may be present within a larger polypeptide that contains additional 
sequences. The additional sequences may be derived from the native protein or may be 
heterologous, and such sequences may be immunoreactive and/or antigenic. 

As used herein, an "immunogenic portion" of a human breast tumor protein is 
a portion that is capable of eliciting an immune response in a patient inflicted with breast 
cancer and as such binds to antibodies present within sera from a breast cancer patient. Such 
immunogenic portions generally comprise at least about 5 amino acid residues, more 
preferably at least about 10, and most preferably at least about 20 amino acid residues. 
Immunogenic portions of the proteins described herein may be identified in antibody binding 
assays. Such assays may generally be performed using any of a variety of means known to 
those of ordinary skill in the art, as described, for example, in Harlow and Lane, Antibodies: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1988. For 
example, a polypeptide may be iirmiobilized on a solid support (as described below) and 
contacted with patient sera to allow binding of antibodies within the sera to the immobilized 
polypeptide. Unbound sera may then be removed and boimd antibodies detected using, for 
example, *"l-labeled Protein A. Alternatively, a polypeptide may be used to generate 
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monoclonal and polyclonal antibodies for use in detection of the polypeptide in blood or 
other fluids of breast cancer patients. Methods for preparing and identifying immunogenic 
portions of antigens of known sequence are well known in the art and include those 
summarized in Paul, Fundamental Immunology, ed.. Raven Press, 1 993, pp. 243-247. 

The term "polynuc]eotide(s) » as used herein, means a single or double- 
stranded polymer of deoxyribonucleotide or ribonucleotide bases and includes DNA and 
corresponding RNA molecules, including HnRNA and mRNA molecules, both sense and 
anti-sense strands, and comprehends cDNA, genomic DNA and recombinant DNA, as well as 
wholly or partially synthesized polynucleotides. An HnRNA molecule contains imrons and 
corresponds to a DNA molecule in a generally one-to-one manner. An mRNA molecule 
corresponds to an HnRNA and DNA molecule from which the introns have been excised. A 
polynucleotide may consist of an entire gene, or any portion thereof Operable anti-sense 
polynucleotides may comprise a fragment of the corresponding polynucleotide, and the 
definition of "polynucleotide" therefore includes all such operable anti-sense fragments. 

The compositions and methods of the present invention also encompass 
variants of the above polypeptides and polynucleotides. A polypeptide "variant," as used 
herein, is a polypeptide that differs from the recited polypeptide only in conservative 
substitutions and/or modifications, such that the therapeutic, antigenic and/or immunogenic 
properties of the polypeptide are retained. In a preferred embodiment, variant polypeptides 
differ from an identified sequence by substitution, deletion or addition of five amino acids or 
fewer. Such variants may generally be identified by modifying one of the above polypeptide 
sequences, and evaluating the antigenic properties of the modified polypeptide using, for 
example, the representative procedures described herein. Polypeptide variants preferably 
exhibit at least about 70%, more preferably at least about 90% and most preferably at least 
about 95% identity (determined as described below) to the identified polypeptides. 

As used herein, a "conservative substitution" is one in which an amino acid is 
substituted for another amino acid that has similar properties, such that one skilled in the art 
of peptide chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. In general, the following groups of amino acids 
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represent conservative changes: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, 
thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. 

Variants may also, or alternatively, contain other modifications, including the 
deletion or addition of amino acids that have minimal influence on the antigenic properties, 
secondary structure and hydropathic nature of the polypeptide. For example, a polypeptide 
may be conjugated to a signal (or leader) sequence at the N-terminal end of the protein which 
co-translationally or post-translationally directs transfer of the protein. The polypeptide may 
also be conjugated to a linker or other sequence for ease of synthesis, purification or 
identification of the polypeptide {e.g., poly-His), or to enhance binding of the polypeptide to a 
solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc 
region. 

A nucleotide "variant" is a sequence that differs from the recited nucleotide 
sequence in having one or more nucleotide deletions, substitutions or additions. Such 
modifications may be readily introduced using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis as taught, for example, by Adelman et al. 
(DNA, 2:183, 1983). Nucleotide variants may be naturally occurring allelic variants, or non- 
naturally occurring variants. Variant nucleotide sequences preferably exhibit at least about 
70%, more preferably at least about 80% and most preferably at least about 90% identity 
(determined as described below) to the recited sequence. 

The antigens provided by the present invention include variants that are 
encoded by DNA sequences which are substantially homologous to one or more of the DNA 
sequences specifically recited herein. "Substantial homology," as used herein, refers to DNA 
sequences that are capable of hybridizing under moderately stringent conditions. Suitable 
moderately stringent conditions include prewashing in a solution of 5X SSC, 0.5% SDS, 
1.0 mM EDTA (pH 8.0); hybridizing at 50°C-65°C, 5X SSC, ovemight or, in the event of 
cross-species homology, at 45°C with 0.5X SSC; followed by washing twice at 65°C for 20 
minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. Such hybridizing DNA 
sequences are also within the scope of this invention, as are nucleotide sequences that, due to 
code degeneracy, encode an immunogenic polypeptide that is encoded by a hybridizing DNA 
sequence. 
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Two nucleotide or polypeptide sequences are said to be "identical" if the 
sequence of nucleotides or amino acid residues in the two sequences is the same when aligned 
for maximum correspondence as described below. Comparisons between two sequences are 
typically performed by comparing the sequences over a comparison window to identify and 
compare local regions of sequence similarity. A "comparison window" as used herein, refers 
to a segment of at least about 20 contiguous positions, usually 30 to about 75, more 
preferably 40 to about 50, in which a sequence may be compared to a reference sequence of 
the same number of contiguous positions after the two sequences are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using the 
Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., 
Madison, WI), using default parameters. This program embodies several alignment schemes 
described in the following references: Dayhoff, M.O. (1978) A model of evolutionary change 
in proteins - Matrices for detecting distant relationships. In Dayhoff, M.O (ed.) Atlas of 
Protein Sequence and Structure, National Biomedical Resarch Foundaiton, Washington DC 
Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified Approach to Alignment and Phylogenes 
pp. 626-645 Methods in Enzymology vol. 183, Academic Press, Inc., San Diego, CA; 
Higgins, D.G. and Sharp, P.M. (1989) Fast and sensitive multiple sequence alignments on a 
microcomputer CABIOS 5\\5\-\53\ Myers, E.W. and Muller W. (1988) Optimal alignments 
in linear space CABIOS 4:UA1\ Robinson, E.D. (1971) Comb. Theor 77:105; Santou, N. 
Nes, M. (1987) The neighbor joining method. A new method for reconstructing phylogenetic 
trees Mol Biol Evol ^:406-425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical 
Taxonomy - the Principles and Practice of Numerical Taxonomy, Freeman Press, San 
Francisco, CA; Wilbur, W.J. and Lipman, D.J. (1983) Rapid similarity searches of nucleic 
acid and protein data banks Proc. Natl Acad., ScL USA 80:126-130, 

Preferably, the ''percentage of sequence identity" is determined by comparing 
two optimally aligned sequences over a window of comparison of at least 20 positions, 
wherein the portion of the polynucleotide sequence in the comparison window may comprise 
additions or deletions (i.e. gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 
percent, as compared to the reference sequences (which does not comprise additions or 
deletions) for optimal aligrunent of the two sequences. The percentage is calculated by 



BNSDOCID: <WO 9933869A2J_> 



wo 99/33869 



PCT/US98/27416 



9 

determining the number of positions at which the identical nucleic acid bases or amino acid 
residue occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the reference sequence (i.e. 
the window size) and multiplying the results by 100 to yield the percentage of sequence 
identity. 

Also included in the scope of the present invention are alleles of the genes encoding 
the nucleotide sequences recited herein. As used herein, an "allele" or "allellic sequence" is 
an alternative form of the gene which may result from at least one mutation in the nucleic 
acid sequence. Alleles may result in altered mRNAs or polypeptides whose structure or 
function may or may not be altered. Any given gene may have none, one, or many allelic 
forms. Common mutational changes which give rise to alleles are generally ascribed to 
natural deletions, additions, or substitutions of nucleotides. Each of these types of changes 
may occur alone or in combination with the others, one or more times in a given sequence. 

For breast tumor polypeptides with immunoreactive properties, variants may, 
alternatively, be identified by modifying the amino acid sequence of one of the above 
polypeptides, and evaluating the immunoreactivity of the modified polypeptide. For breast 
tumor polypeptides useful for the generation of diagnostic binding agents, a variant may be 
identified by evaluating a modified polypeptide for the ability to generate antibodies that 
detect the presence or absence of breast cancer. Such modified sequences may be prepared 
and tested using, for example, the representative procedures described herein. 

The breast tumor proteins of the present invention, and polynucleotide 
molecules encoding such proteins, may be isolated from breast tumor tissue using any of a 
variety of methods well known in the art. Polynucleotide sequences corresponding to a gene 
(or a portion thereof) encoding one of the inventive breast tumor proteins may be isolated 
from a breast tumor cDNA library using a subtraction technique as described in detail below. 
Examples of such DNA sequences are provided in SEQ ID NOS: 1- 94. Partial 
polynucleotide sequences thus obtained may be used to design oligonucleotide primers for 
the amplification of full-length polynucleotide sequences in a polymerase chain reaction 
(PGR), using techniques well known in the art (see, for example, MuUis et a\,. Cold Spring 
Harbor Symp. QuanL Biol., 51:263, 1987; Erlich ed., PCR Technology, Stockton Press, NY, 
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1989). Once a polynucleotide sequence encoding a polypeptide is obtained, any of the above 
modifications may be readily introduced using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis as taught, for example, by Adelman et al. 
(i)A^^ 2:183, 1983). 

The breast tumor polypeptides disclosed herein may also be generated by 
synthetic or recombinant means. Synthetic polypeptides having fewer than about 100 amino 
acids, and generally fewer than about 50 amino acids, may be generated using techniques 
well known to those of ordinary skill in the art. For example, such polypeptides may be 
synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain (see, for example, Merrifield, J. Am. Chem. Soc. 55:2149-2146, 
1963). Equipment for automated synthesis of polypeptides is commercially available from 
suppliers such as Perkin Elmer/ Applied BioSystems Division (Foster City, CA), and may be 
operated according to the manufacturers instructions. 

Alternatively, any of the above polypeptides may be produced recombinantly 
by inserting a polynucleotide sequence that encodes the polypeptide into an expression 
vector and expressing the protein in an appropriate host. Any of a variety of expression 
vectors known to those of ordinary skill in the art may be employed to express recombinant 
polypeptides of this invention. Expression may be achieved in any appropriate host cell that 
has been transformed or transfected with an expression vector containing a polynucleotide 
molecule that encodes a recombinant polypeptide. Suitable host cells include prokaryotes, 
yeast and higher eukaryotic cells. Preferably, the host cells employed are E. coli^ yeast or a 
mammalian cell line, such as CHO cells. The polynucleotide sequences expressed in this 
marmer may encode naturally occurring polypeptides, portions of naturally occurring 
polypeptides, or other variants thereof. 

In general, regardless of the method of preparation, the polypeptides disclosed 
herein are prepared in an isolated, substantially pure form (/.<?., the polypeptides are 
homogenous as determined by amino acid composition and primary sequence analysis). 
Preferably, the polypeptides are at least about 90% pure, more preferably at least about 95% 
pure arid most preferably at least about 99% pure. In certain preferred embodiments. 
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described in more detail below, the substantially pure polypeptides are incorporated into 
pharmaceutical compositions or vaccines for use in one or more of the methods disclosed 
herein. 

In a related aspect, the present invention provides fusion proteins comprising a 
first and a second inventive polypeptide or, alternatively, a polypeptide of the present 
invention and a known breast tumor antigen, together with variants of such fusion proteins. 

A polynucleotide sequence encoding a fusion protein of the present invention 
is constructed using knovm recombinant DNA techniques to assemble separate 
polynucleotide sequences encoding the first and second polypeptides into an appropriate 
expression vector. The 3' end of a polynucleotide sequence encoding the first polypeptide is 
ligated, with or without a peptide linker, to the 5' end of a polynucleotide sequence encoding 
the second polypeptide so that the reading frames of the sequences are in phase to permit 
mRNA translation of the two DNA sequences into a single fusion protein that retains the 
biological activity of both the first and the second polypeptides. 

A peptide linker sequence may be employed to separate the first and the 
second polypeptides by a distance sufficient to ensure that each polypeptide folds into its 
secondary and tertiary structures. Such a peptide linker sequence is incorporated into the 
fusion protein using standard techniques well known in the art. Suitable peptide linker 
sequences may be chosen based on the following factors: (1) their ability to adopt a flexible 
extended conformation; (2) their inability to adopt a secondary structure that could interact 
with functional epitopes on the first and second polypeptides; and (3) the lack of hydrophobic 
or charged residues that might react v^th the polypeptide functional epitopes. Preferred 
peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, 
such as Thr and Ala may also be used in the linker sequence. Amino acid sequences which 
may be usefully employed as linkers include those disclosed in Maratea et al.. Gene 40:39-46, 
1985; Murphy et al., Proc, Nad Acad Sci. USA 53:8258-8262, 1986; U.S. Patent 
No. 4,935,233 and U.S. Patent No. 4,751,180. The hnker sequence may be from 1 to about 
50 amino acids in length. Peptide sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to separate 
the functional domains and prevent steric interference. 



BNSDOCID: <WO 9933869A2J_> 



wo 99/33869 



PCT/US98/27416 



12 

The ligated polynucleotide sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements responsible for 
expression of polynucleotides are located only 5' to the polynucleotide sequence encoding 
the first polypeptides. Similarly, stop codons require to end translation and transcription 
termination signals are only present 3' to the polynucleotide sequence encoding the second 
polypeptide. 

Fusion proteins are also provided that comprise a polypeptide of the present 
invention together with an unrelated immunogenic protein. Preferabh' the immunogenic 
protein is capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl J, Med,, 356:86- 
91 (1997)). 

Polypeptides of the present invention that comprise an immunogenic portion 
of a breast tumor protein may generally be used for immunotherapy of breast cancer, w^herein 
the polypeptide stimulates the patient's own immune response to breast tumor cells. In 
further aspects, the present invention provides methods for using one or more of the 
immunoreactive polypeptides encoded by a polynucleotide molecule having a sequence 
provided in SEQ ID NOS: 1- 94 (or fusion proteins comprising one or more such 
polypeptides and/or polynucleotides encoding such polypeptides) for immunotherapy of 
breast cancer in a patient. As used herein, a "patient" refers to any warm-blooded animal, 
preferably a human. A patient may be afflicted with a disease, or may be free of detectable 
disease. Accordingly, the above immunoreactive polypeptides (or fusion proteins or 
polynucleotide molecules encoding such polypeptides) may be used to treat breast cancer or 
to inhibit the development of breast cancer. The polypeptides may be administered either 
prior to or following surgical removal of primary tumors and/or treatment by administration 
of radiotherapy and conventional chemotherapeutic drugs. 

In these aspects, the polypeptide or fusion protein is generally present within a 
pharmaceutical composition and/or a vaccine. Pharmaceutical compositions may comprise 
one or more polypeptides, each of which may contain one or more of the above sequences (or 
variants thereof), and a physiologically acceptable carrier. The vaccines may comprise one or 
more of such polypeptides and a non-specific immune response enhancer, wherein the non- 
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specific immune response enhancer is capable of eliciting or enhancing an immune response 
to an exogenous antigen. Examples of non-specific-immune response enhancers include 
adjuvants, biodegradable microspheres (eg., polylactic galactide) and liposomes (into which 
the polypeptide is incorporated). Pharaiaceutical compositions and vaccines may also contain 
other epitopes of breast tumor antigens, either incorporated into a combination polypeptide 
{i.e., a single polypeptide that contains multiple epitopes) or present within a separate 
polypeptide. 

Alternatively, a pharmaceutical composition or vaccine may contain 
polynucleotides encoding one or more of the above polypeptides, such that the polypeptide is 
generated in situ. In such pharmaceutical compositions and vaccines, the polynucleotide 
may be present within any of a variety of delivery systems known to those of ordinary skill in 
the art, including nucleic acid expression systems, bacteria and viral expression systems. 
Appropriate nucleic acid expression systems contain the necessary polynucleotide sequences 
for expression in the patient (such as a suitable promoter). Bacterial delivery systems involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerriri) that expresses an 
epitope of a breast tumor cell antigen on its cell surface. In a preferred embodiment, the 
polynucleotide molecules may be introduced using a viral expression system (e.g., vaccinia 
or other pox virus, retrovirus, or adenovirus), which may involve the use of a non-pathogenic 
(defective), replication competent virus. Suitable systems are disclosed, for example, in 
Fisher-Hoch et al., PNAS 5(5:317-321, 1989; Flexner et al., Ann, N.Y. Acad Sci. 5dP:86-103, 
1989; Flexner et al.. Vaccine (?:17-21, 1990; U.S. Patent Nos. 4,603,1 12, 4,769,330, and 
5,017,487; WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 6:616-627, 1988; Rosenfeld et al„ Science 
252:431-434, 1991; Kolls et al., PNAS Pi :2 15-2 19, 1994; Kass-Eisler et al., PNAS 
P0:1 1498-1 1502, 1993; Guzman et al.. Circulation 55:2838-2848, 1993; and Guzman et al., 
Cir. Res. 7J: 1202-1207, 1993. Techniques for incorporating polynucleotides into such 
expression systems are well knov^^i to those of ordinary skill in the art. The polynucleotides 
may also be "naked," as described, for example, in published PCT application WO 90/1 1092, 
and Ulmer et al.. Science 25P:1745-1749, 1993, reviewed by Cohen, Science 25P:1 691 -1692, 
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1993. The uptake of naked polynucleotides may be increased by coating the polynucleotides 
onto biodegradable beads, which are efficiently transported into the cells. 

Routes and frequency of administration, as well as dosage, will vary from 
individual to individual and may parallel those currently being used in immunotherapy of 
other diseases. In general, the pharmaceutical compositions and vaccines may be 
administered by injection {e.g,, intracutaneous, intramuscular, intravenous or subcutaneous), 
intranasally {e.g,, by aspiration) or orally. Between 1 and 10 doses may be administered over 
a 3-24 week period. Preferably, 4 doses are administered, at an interval of 3 months, and 
booster administrations may be given periodically thereafter. Alternate protocols may be 
appropriate for individual patients. A suitable dose is an amount of polypeptide or 
polynucleotide molecule that is effective to raise an immune response (cellular and/or 
humoral) against breast tumor cells in a treated patient. A suitable immune response is at 
least 10-50% above the basal (i.e., untreated) level. In general, the amount of polypeptide 
present in a dose (or produced in situ by the polynucleotide in a dose) ranges from about 1 pg 
to about 100 mg per kg of host, typically from about 10 pg to about 1 mg, and preferably 
from about 100 pg to about 1 ^g. Suitable dose sizes will vary with the size of the patient, 
but will typically range from about 0.01 mL to about 5 mL. 

While any suitable carrier known to those of ordinary skill in the art may be 
employed in the pharmaceutical compositions of this invention, the type of carrier will vary 
depending on the mode of administration. For parenteral administration, such as 
subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a lipid, a wax 
and/or a buffer. For oral administration, any of the above carriers or a solid carrier, such as 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, 
sucrose, and/or magnesium carbonate, may be employed. Biodegradable microspheres (e.g., 
polylactic glycolide) may also be employed as carriers for the pharmaceutical compositions 
of this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268 and 5,075,109. 

Any of a variety of non-specific immune response enhancers may be employed 
in the vaccines of this invention. For example, an adjuvant may be included. Most adjuvants 
contain a substance designed to protect the antigen from rapid catabolism, such as aluminum 



BNSDOCID: <WO 9933869A2J_> 



wo 99/33869 



15 



PCT/US98/27416 



hydroxide or mineral oil, and a nonspecific stimulator of immune response, such as lipid A, 
Bordello pertussis or Mycobacterium tuberculosis. Such adjuvants are commercially 
available as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco 
Laboratories, Detroit, MI) and Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ). 

Polypeptides disclosed herein may also be employed in adoptive 
immunotherapy for the treatment of cancer. Adoptive immunotherapy may be broadly 
classified into either active or passive immunotherapy. In active immunotherapy, treatment 
relies on the in vivo stimulation of the endogenous host immune system to react against 
tumors with the administration of immune response-modifying agents (for example, tumor 
vaccines, bacterial adjuvants, and/or cytokines). 

In passive immunotherapy, treatment involves the delivery of biologic 
reagents with established tumor-immune reactivity (such as effector cells or antibodies) that 
can directly or indirectly mediate antitumor effects and does not necessarily depend on an 
intact host immune system. Examples of effector cells include T lymphocytes (for example, 
CD8+ cytotoxic T-lymphocyte, CD4+ T-helper, tumor-infiltrating lymphocytes), killer cells 
(such as Natural Killer cells, lymphokine-activated killer cells), B cells, or antigen presenting 
cells (such as dendritic cells and macrophages) expressing the disclosed antigens. The 
polypeptides disclosed herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as in U.S. Patent No. 4,918,164), for passive immunotherapy. 

The predominant method of procuring adequate numbers of T-cells for 
adoptive immunotherapy is to grow immune T-cells in vitro. Culture conditions for 
expanding single antigen-specific T-cells to several billion in number vdth retention of 
antigen recognition in vivo are well known in the art. These in vitro culture conditions 
typically utilize intermittent stimulation with antigen, often in the presence of cytokines, such 
as IL-2, and non-dividing feeder cells. As noted above, the immunoreactive polypeptides 
described herein may be used to rapidly expand antigen-specific T cell cultures in order to 
generate sufficient number of cells for immunotherapy. In particular, antigen-presenting 
cells, such as dendritic, macrophage or B-cells, may be pulsed with immunoreactive 
polypeptides or transfected with a polynucleotide sequence(s), using standard techniques well 
known in the art. For example, antigen presenting cells may be transfected with a 
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polynucleotide sequence, wherein said sequence contains a promoter region appropriate for 
increasing expression, and can be expressed as part of a recombinant virus or other expression 
system. For cuhured T-cells to be effective in therapy, the cultured T-cells must be able to 
grow and distribute widely and to survive long term in vivo. Studies have demonstrated that 
cultured T-cells can be induced to grow in vivo and to survive long term in substantial 
numbers by repeated stimulation with antigen supplemented vAth IL-2 (see, for example, 
Cheever, M., et al, "Therapy With Cultured T Cells: Principles Revisited, " Immunological 
Reviews, 157:177,1997). 

The polypeptides disclosed herein may also be employed to generate and/or 
isolate tumor-reactive T-cells, which can then be administered to the patient. In one 
technique, antigen-specific T-cell lines may be generated by in vivo irxmiunization with short 
peptides corresponding to immunogenic portions of the disclosed polypeptides. The resulting 
antigen specific CD8+ CTL clones may be isolated from the patient, expanded using standard 
tissue culture techniques, and returned to the patient. 

Alternatively, peptides corresponding to immunogenic portions of the 
polypeptides may be employed to generate tumor reactive T cell subsets by selective in vitro 
stimulation and expansion of autologous T cells to provide antigen-specific T cells which 
may be subsequently transferred to the patient as described, for example, by Chang et al. 
(CriL Rev. Oncol. Hematol, 22(3), 213, 1996). Cells of the immune system, such as T cells, 
may be isolated from the peripheral blood of a patient, using a commercially available cell 
separation system, such as CellPro Incorporated's (Bothell, WA) CEPRATE™ system (see 
U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 91/161 16 and WO 
92/07243). The separated cells are stimulated with one or more of the immunoreactive 
polypeptides contained within a delivery vehicle, such as a microsphere, to provide antigen- 
specific T cells. The population of tumor antigen-specific T cells is then expanded using 
standard techniques and the cells are administered back to the patient. 

In another embodiment, T-cell and/or antibody receptors specific for the 
polypeptides can be cloned, expanded, and transferred into other vectors or effector cells for 
use in adoptive immunotherapy. 

In a further embodiment, syngeneic or autologous dendritic cells may be 
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pulsed with peptides corresponding to at least an immunogenic portion of a polypeptide 
disclosed herein. The resulting antigen-specific dendritic cells may either be transferred into 
a patient, or employed to stimulate T cells to provide antigen-specific T cells which may, in 
turn, be administered to a patient. The use of peptide-pulsed dendritic cells to generate 
antigen-specific T cells and the subsequent use of such antigen-specific T cells to eradicate 
tumors in a murine model has been demonstrated by Cheever et al. Immunological Reviews, 
157'Ml, 1997). 

Additionally, vectors expressing the disclosed polynucleotides may be introduced into 
stem cells taken from the patient and clonally propagated in vitro for autologous transplant 
back into the same patient. 

Polypeptides of the present invention may also, or alternatively, be used to 
generate binding agents, such as antibodies or fragments thereof, that are capable of detecting 
metastatic human breast tumors. Binding agents of the present invention may generally be 
prepared using methods known to those of ordinary skill in the art, including the 
representative procedures described herein. Binding agents are capable of differentiating 
between patients with and without breast cancer, using the representative assays described 
herein. In other words, antibodies or other binding agents raised against a breast tumor 
protein, or a suitable portion thereof, will generate a signal indicating the presence of primary 
or metastatic breast cancer in at least about 20% of patients afflicted with the disease, and will 
generate a negative signal indicating the absence of the disease in at least about 90% of 
individuals without primary or metastatic breast cancer. Suitable portions of such breast 
tumor proteins are portions that are able to generate a binding agent that indicates the 
presence of primary or metastatic breast cancer in substantially all (/.^., at least about 80%, 
and preferably at least about 90%) of the patients for which breast cancer would be indicated 
using the full length protein, and that indicate the absence of breast cancer in substantially all 
of those samples that would be negative when tested with full length protein. The 
representative assays described below, such as the two-antibody sandwich assay, may 
generally be employed for evaluating the ability of a binding agent to detect metastatic human 
breast tumors. 
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The ability of a polypeptide prepared as described herein to generate 
antibodies capable of detecting primary or metastatic human breast tumors may generally be 
evaluated by raising one or more antibodies against the polypeptide (using, for example, a 
representative method described herein) and determining the ability of such antibodies to 
detect such tumors in patients. This determination may be made by assaying biological 
samples from patients with and without primary or metastatic breast cancer for the presence 
of a polypeptide that binds to the generated antibodies. Such test assays may be performed, 
for example, using a representative procedure described below. Polypeptides that generate 
antibodies capable of detecting at least 20% of primary or metastatic breast tumors by such 
procedures are considered to be useftil in assays for detecting primary or metastatic human 
breast tumors. Polypeptide specific antibodies may be used alone or in combination to 
improve sensitivity. 

Polypeptides capable of detecting primary or metastatic human breast tumors 
may be used as markers for diagnosing breast cancer or for monitoring disease progression in 
patients. In one embodiment, breast cancer in a patient may be diagnosed by evaluating a 
biological sample obtained from the patient for the level of one or more of the above 
polypeptides, relative to a predetermined cut-off value. As used herein, suitable "biological 
samples" include blood, sera and urine. 

The level of one or more of the above polypeptides may be evaluated using 
any binding agent specific for the polypeptide(s). A "binding agent," in the context of this 
invention, is any agent (such as a compound or a cell) that binds to a polypeptide as described 
above. As used herein, "binding" refers to a noncovalent association between two separate 
molecules (each of which may be free (i.e., in solution) or present on the surface of a cell or a 
solid support), such that a "complex" is formed. Such a complex may be free or immobilized 
(either covalently or noncovalently) on a support material. The ability to bind may generally 
be evaluated by determining a binding constant for the formation of the complex. The 
binding constant is the value obtained when the concentration of the complex is divided by 
the product of the component concentrations. In general, two compounds are said to "bind" 
in the context of the present invention when the binding constant for complex formation 
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exceeds about lO^L/mol. The binding constant may be determined using methods well 
known to those of ordinary skill in the art. 

Any agent that satisfies the above requirements may be a binding agent. For 
example, a binding agent may be a ribosome with or without a peptide component, an RNA 
molecule or a peptide. In a preferred embodiment, the binding partner is an antibody, or a 
fragment thereof. Such antibodies may be polyclonal, or monoclonal. In addition, the 
antibodies may be single chain, chimeric, CDR-grafted or humanized. Antibodies may be 
prepared by the methods described herein and by other methods well known to those of skill 
in the art. 

There are a variety of assay formats known to those of ordinary skill in the art 
for using a binding partner to detect polypeptide markers in a sample. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In a 
preferred embodiment, the assay involves the use of binding partner immobilized on a solid 
support to bind to and remove the polypeptide from the remainder of the sample. The bound 
polypeptide may then be detected using a second binding partner that contains a reporter 
group. Suitable second binding partners include antibodies that bind to the binding 
partner/polypeptide complex. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized binding 
partner after incubation of the binding partner with the sample. The extent to which 
components of the sample inhibit the binding of the labeled polypeptide to the binding 
partner is indicative of the reactivity of the sample with the immobilized binding partner. 

The solid support may be any material known to those of ordinary skill in the 
art to which the antigen may be attached. For example, the solid support may be a test well in 
a microtiter plate or a nitrocellulose or other suitable membrane. Altematively, the support 
may be a bead or disc, such as glass, fiberglass, latex or a plastic material such as polystyrene 
or polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, 
such as those disclosed, for example, in U.S. Patent No. 5,359,681. The binding agent may 
be immobilized on the solid support using a variety of techniques known to those of skill in 
the art, which are amply described in the patent and scientific literature. In the context of the 
present invention, the term "immobilization*' refers to both noncovalent association, such as 
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adsorption, and covalent attachment (which may be a direct linkage between the antigen and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 
Immobilization by adsorption to a well in a microtiter plate or to a membrane is preferred. In 
such cases, adsorption may be achieved by contacting the binding agent, in a suitable buffer, 
with the solid support for a suitable amount of time. The comact time varies with 
temperature, but is typically between about 1 hour and about 1 day. In general, contacting a 
well of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of 
binding agent ranging from about 10 ng to about 10 ^g, and preferably about 100 ng to about 
1 ^g, is sufficient to immobilize an adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifimctional reagent that will react with both the 
support and a functional group, such as a hydroxyl or amino group, on the binding agent. For 
example, the binding agent may be covalently attached to supports having an appropriate 
polymer coating using benzoquinone or by condensation of an aldehyde group on the support 
with an amine and an active hydrogen on the binding partner (see, e.g.. Pierce 
Immunotechnology Catalog and Handbook, 1991, at AI2-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. This 
assay may be performed by first contacting an antibody that has been immobilized on a solid 
support, commonly the well of a microtiter plate, with the sample, such that polypeptides 
within the sample are allowed to bind to the immobilized antibody. Unbound sample is then 
removed from the immobilized polypeptide-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added. The amount of second antibody that remains bound to the solid support is then 
determined using a method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically blocked. 
Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum 
albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The immobilized antibody is 
then incubated with the sample, and polypeptide is allowed to bind to the antibody. The 
sample may be diluted with a suitable diluent, such as phosphate-buffered saline (PBS) prior 
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to incubation. In general, an appropriate contact time (re., incubation time) is that period of 
time that is sufficient to detect the presence of polypeptide within a sample obtained from an 
individual with breast cancer. Preferably, the contact time is sufficient to achieve a level of 
binding that is at least about 95% of that achieved at equilibrium between bound and unbound 
polypeptide. Those of ordinary skill in the art will recognize that the time necessary to 
achieve equilibrium may be readily determined by assaying the level of binding that occurs 
over a period of time. At room temperature, an incubation time of about 30 minutes is 
generally sufficient. 

Unbound sample may then be removed by washing the solid support with an 
appropriate buffer, such as PBS containing 0.1% Tween 20™. The second antibody, which 
contains a reporter group, may then be added to the solid support. Preferred reporter groups 
include enzymes (such as horseradish peroxidase), substrates, cofactors, inhibitors, dyes, 
radionuclides, luminescent groups, fluorescent groups and biotin. The conjugation of 
antibody to reporter group may be achieved using standard methods known to those of 
ordinary skill in the art. 

The second antibody is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. An 
appropriate amount of time may generally be determined by assaying the level of binding that 
occurs over a period of time. Unbound second antibody is then removed and bound second 
antibody is detected using the reporter group. The method employed for detecting the 
reporter group depends upon the nature of the reporter group. For radioactive groups, 
scintillation counting or autoradiographic methods are generally appropriate. Spectroscopic 
methods may be used to detect dyes, luminescent groups and fluorescent groups. Biotin may 
be detected using avidin, coupled to a different reporter group (commonly a radioactive or 
fluorescent group or an enzyme). Enzyme reporter groups may generally be detected by the 
addition of substrate (generally for a specific period of time), followed by spectroscopic or 
other analysis of the reaction products. 

To determine the presence or absence of breast cancer, the signal detected 
from the reporter group that remains bound to the solid support is generally compared to a 
signal that corresponds to a predetermined cut-off value. In one preferred embodiment, the 
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cut-off value is the average mean signal obtained when the immobilized antibody is incubated 
with samples from patients without breast cancer. In general, a sample generating a signal 
that is three standard deviations above the predetermined cut-off value is considered positive 
for breast cancer. In an alternate preferred embodiment, the cut-off value is determined using 
a Receiver Operator Curve, according to the method of Sackett at al., Clinical Epidemiology: 
A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this 
embodiment, the cut-off value may be determined from a plot of pairs of true positive rates 
{i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible 
cut-off value for the diagnostic test resuh. The cut-off value on the plot that is the closest to 
the upper left-hand comer {i.e., the value that encloses the largest area) is the most accurate 
cut-off value, and a sample generating a signal that is higher than the cut-off value 
determined by this method may be considered positive. Alternatively, the cut-off value may 
be shifted to the left along the plot, to minimize the false positive rate, or to the right, to 
minimize the false negative rate. In general, a sample generating a signal that is higher than 
the cut-off value determined by this method is considered positive for breast cancer. 

In a related embodiment, the assay is performed in a flow-through or strip test 
format, wherein the antibody is immobilized on a membrane, such as nitrocellulose. In the 
flow-through test, polypeptides within the sample bind to the immobilized antibody as the 
sample passes through the membrane. A second, labeled antibody then binds to the antibody- 
polypeptide complex as a solution containing the second antibody flows through the 
membrane. The detection of bound second antibody may then be performed as described 
above. In the strip test format, one end of the membrane to which antibody is bound is 
immersed in a solution containing the sample. The sample migrates along the membrane 
through a region containing second antibody and to the area of immobilized antibody. 
Concentration of second antibody at the area of immobilized antibody indicates the presence 
of breast cancer. Typically, the concentration of second antibody at that site generates a 
pattern, such as a line, that can be read visually. The absence of such a pattern indicates a 
negative result. In general, the amount of antibody immobilized on the membrane is selected 
to generate a visually discernible pattern when the biological sample contains a level of 
polypeptide that would be sufficient to generate a positive signal in the two-antibody 
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sandwich assay, in the format discussed above. Preferably, the amount of antibody 
immobilized on the membrane ranges from about 25 ng to about 1 ^g, and more preferably 
from about 50 ng to about 500 ng. Such tests can typically be performed with a very small 
amount of biological sample. 

Of course, numerous other assay protocols exist that are suitable for use with 
the antigens or antibodies of the present invention. The above descriptions are intended to be 
exemplary only. 

In another embodiment, the above polypeptides may be used as markers for 
the progression of breast cancer. In this embodiment, assays as described above for the 
diagnosis of breast cancer may be performed over time, and the change in the level of reactive 
polypeptide(s) evaluated. For example, the assays may be performed every 24-72 hours for a 
period of 6 months to 1 year, and thereafter performed as needed. In general, breast cancer is 
progressing in those patients in whom the level of polypeptide detected by the binding agent 
increases over time. In contrast, breast cancer is not progressing when the level of reactive 
polypeptide either remains constant or decreases with time. 

Antibodies for use in the above methods may be prepared by any of a variety 
of techniques known to those of ordinary skill in the art. See, e.g,, Harlow and Lane, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In one such 
technique, an immunogen comprising the antigenic polypeptide is initially injected into any 
of a wide variety of mammals (e.g., mice, rats, rabbits, sheep and goats). In this step, the 
polypeptides of this invention may serve as the immunogen without modification. 
Alternatively, particularly for relatively short polypeptides, a superior immune response may 
be elicited if the polypeptide is joined to a carrier protein, such as bovine serum albumin or 
keyhole limpet hemocyanin. The immunogen is injected into the animal host, preferably 
according to a predetermined schedule incorporating one or more booster immunizations, and 
the animals are bled periodically. Polyclonal antibodies specific for the polypeptide may then 
be purified from such antisera by, for example, affinity chromatography using the polypeptide 
coupled to a suitable solid support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest may 
be prepared, for example, using the technique of Kohler and Milstein, Eur, J. Immunol 
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<5:51 1-519, 1976, and improvements thereto. Briefly, these methods involve the preparation 
of immortal cell lines capable of producing antibodies having the desired specificity (i,e., 
reactivity with the polypeptide of interest). Such cell lines may be produced, for example, 
from spleen cells obtained from an animal immunized as described above. The spleen cells 
are then immortalized by, for example, fusion with a myeloma cell fusion partner, preferably 
one that is syngeneic with the immunized animal. A variety of fusion techniques may be 
employed. For example, the spleen cells and myeloma cells may be combined with a 
nonionic detergent for a few minutes and then plated at low density on a selective medium 
that supports the growth of hybrid cells, but not myeloma cells. A preferred selection 
technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient 
time, usually about 1 to 2 weeks, colonies of hybrids are observed. Single colonies are 
selected and tested for binding activity against the polypeptide. Hybridomas having high 
reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the yield, 
such as injection of the hybridoma cell line into the peritoneal cavity of a suitable vertebrate 
host, such as a mouse. Monoclonal antibodies may then be harvested from the ascites fluid or 
the blood. Contaminants may be removed from the antibodies by conventional techniques, 
such as chromatography, gel filtration, precipitation, and extraction. The polypeptides of this 
invention may be used in the purification process in, for example, an affinity chromatography 
step. 

Monoclonal antibodies of the present invention may also be used as 
therapeutic reagents, to diminish or eliminate breast tumors. The antibodies may be used on 
their own (for instance, to inhibit metastases) or coupled to one or more therapeutic agents. 
Suitable agents in this regard include radionuclides, differentiation inducers, drugs, toxins, 
and derivatives thereof Preferred radionuclides include ^°Y, '^X '^^Re, **^Re, ^'^At, 

and ^'^Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred toxins 
include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella 
toxin, and pokeweed antiviral protein. 
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A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable 
monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct reaction 
between an agent and an antibody is possible when each possesses a substituent capable of 
reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl 
group, on one may be capable of reacting with a carbonyl-containing group, such as an 
anhydride or an acid halide, or with an alkyl group containing a good leaving group (e.g., a 
halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an antibody 
from an agent in order to avoid interference with binding capabilities. A linker group can 
also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and 
thus increase the coupling efficiency. An increase in chemical reactivity may also facilitate 
the use of agents, or functional groups on agents, which otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional or 
polyfiinctional reagents, both homo- and hetero-functional (such as those described in the 
catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker group. 
CoupUng may be effected, for example, through amino groups, carboxyl groups, sulfhydryl 
groups or oxidized carbohydrate residues. There are numerous references describing such 
methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody portion 
of the immunoconjugates of the present invention, it may be desirable to use a linker group 
which is cleavable during or upon internalization into a cell. A number of different cleavable 
linker groups have been described. The mechanisms for the intracellular release of an agent 
from these linker groups include cleavage by reduction of a disulfide bond (e.g., U.S. Patent 
No. 4,489,710, to Spitler), by irradiation of a photolabile bond (e.g., U.S. Patent 
No. 4,625,014, to Senter et al.), by hydrolysis of derivatized amino acid side chains (e.g., U.S. 
Patent No. 4,638,045, to Kohn et al.), by serum complement-mediated hydrolysis (e.g., U.S. 
Patent No. 4,671,958, to Rodwell etal.), and acid-catalyzed hydrolysis (e.g., U.S. Patent 
No. 4,569,789, to Blattler et al.). 
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It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agem may 
be prepared in a variety of ways. For example, more than one agent may be coupled directly 
to an antibody molecule, or linkers which provide multiple sites for attachment can be used. 
Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent bonding 
either directly or via a linker group. Suitable carriers include proteins such as albumins {e.g., 
U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides such as aminodextran' 
{e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may also bear an agent by 
noncovalent bonding or by encapsulation, such as within a liposome vesicle (e.g., U.S. Patent 
Nos. 4,429,008 and 4,873,088). Carriers specific for radionuclide agJnts include 
radiohalogenated small molecules and chelating compounds. For example, U.S. Patent No. 
4,735,792 discloses representative radiohalogenated small molecules and their synthesis. A 
radionuclide chelate may be formed from chelating compounds that include those containing 
nitrogen and sulfur atoms as the donor atoms for binding the metal, or metal oxide, 
radionuclide. For example, U.S. Patent No. 4,673,562, to Davison etal. discloses 
representative chelating compounds and their synthesis. 

A variety of routes of administration for the antibodies and immunoconjugates 
may be used. Typically, administration will be intravenous, intramuscular, subcutaneous or 
in the bed of a resected tumor. It will be evident that the precise dose of the 
antibody/immunoconjugate will vary depending upon the antibody used, the antigen density 
on the tumor, and the rate of clearance of the antibody. 

Diagnostic reagents of the present invention may also comprise polynucleotide 
sequences encoding one or more of the above polypeptides, or one or more portions thereof 
For example, at least two oligonucleotide primers may be employed in a polymerase chain 
reaction (PGR) based assay to amplify breast tumor-specific cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for a DNA molecule 
encoding a breast tumor protein of the present invention. The presence of the amplified 
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cDNA is then detected using techniques well known in the art, such as gel electrophoresis. 
Similarly, oligonucleotide probes specific for a DNA molecule encoding a breast tumor 
protein of the present invention may be used in a hybridization assay to detect the presence of 
an inventive polypeptide in a biological sample. 

As used herein, the term "oligonucleotide primer/probe specific for a DNA 
molecule" means an oligonucleotide sequence that has at least about 60%, preferably at least 
about 75% and more preferably at least about 90%, identity to the DNA molecule in question. 
Oligonucleotide primers and/or probes which may be usefully employed in the inventive 
diagnostic methods preferably have at least about 10-40 nucleotides. In a preferred 
embodiment, the oHgonucleotide primers comprise at least about 10 contiguous nucleotides 
of a DNA molecule having a partial sequence selected from SEQ ID NOS: 1- 94. Preferably, 
oligonucleotide probes for use in the inventive diagnostic methods comprise at least about 15 
contiguous oligonucleotides of a DNA molecule having a partial sequence provided in SEQ 
ID NOS: 1- 94. Techniques for both PGR based assays and hybridization assays are well 
known in the art (see, for example, Mullis et al. Ibid; Ehrlich, Ibid), Primers or probes may 
thus be used to detect breast tumor-specific sequences in biological samples, including blood, 
urine and/or breast tumor tissue. 

The following Examples are offered by way of illustration and not by way of 

limitation. 



EXAMPLES 
Example 1 

ISOLATION AND CHARACTERIZATION OF BREAST 
TUMOR POLYPEPTIDES 

This Example describes the isolation of breast tumor polypeptides from a 
breast tumor cDN A library. 

A human breast tumor cDNA expression library was constructed from a pool 
of breast tumor poly A" RNA from three patients using a Superscript Plasmid System for 
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cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies, Gaithersburg, MD 20897) 
following the manufacturers protocol. Specifically, breast tumor tissues were homogenized 
with polytron (Kinematica, Switzerland) andHotal RNA was extracted using Trizol reagent 
(BRL Life Technologies) as directed by the manufacturer. The poly A"" RNA was then 
purified using a Qiagen oligotex spin column mRNA purification kit (Qiagen, Santa Clarita, 
CA 91355) according to the manufacturer's protocol. First-strand cDNA was synthesized 
using the NotI/01igo-dT18 primer. Double-stranded cDNA was synthesized, ligated with 
EcoRI/BstX I adaptors (Invitrogen, Carlsbad, CA) and digested with NotL Following size 
fractionation with Chroma Spin-1000 columns (Clontech, Palo Alto, CA 94303), the cDNA 
was ligated into the EcoRI/NotI site of pCDNA3.1 (Invitrogen, Carlsbad, CA) and 
transformed into ElectroMax E. coli DHIOB cells (BRL Life Technologies) by 
electroporation. 

Using the same procedure, a normal human breast cDNA expression library 
was prepared from a pool of four normal breast tissue specimens. The cDNA libraries were 
characterized by determining the number of independent colonies, the percentage of clones 
that carried insert, the average insert size and by sequence analysis. The breast tumor library 
contained 1.14 x 10^ independent colonies, with more than 90% of clones having a visible 
insert and the average insert size being 936 base pairs. The normal breast cDNA library 
contained 6x10^ independent colonies, with 83% of clones having inserts and the average 
insert size being 1015 base pairs. Sequencing analysis showed both libraries to contain good 
complex cDNA clones that were synthesized from mRNA, with minimal rRNA and 
mitochondrial DNA contamination sequencing. 

cDNA library subtraction was performed using the above breast tumor and 
normal breast cDNA libraries, as described by Hara et al, {Blood, 5^:189-199, 1994) with 
some modifications. Specifically, a breast tumor-specific subtracted cDNA library was 
generated as follows. Normal breast cDNA library (70 |ag) was digested with EcoRI, NotI, 
and Sful, followed by a filling-in reaction with DNA polymerase Klenow fragment. After 
phenol-chloroform extraction and ethanol precipitation, the DNA was dissolved in 1 00 jil of 
H2O, heat-denatured and mixed with 100 jal (100 jag) of Photoprobe biotin (Vector 
Laboratories, Burlingame, CA), the resulting mixture was irradiated with a 270 W sunlamp 
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on ice for 20 minutes. Additional Photoprobe biotin (50 was added and the biotinylation 
reaction was repeated. After extraction with butanol five times, the DNA was ethanol- 
precipitated and dissolved in 23 \x\ H2O to form the driver DNA. 

To form the tracer DNA, 10 )ig breast tumor cDNA library was digested with 
BamHI and Xhol, phenol chloroform extracted and passed through Chroma spin-400 columns 
(Clontech). Following ethanol precipitation, the tracer DNA was dissolved in 5 ^1 HjO. 
Tracer DNA was mixed with 15 |al driver DNA and 20 jj.1 of 2 x hybridization buffer (1.5 M 
NaCl/10 mM EDTA/50 mM HEPES pH 7.5/0.2% sodium dodecyl sulfate), overlaid with 
mineral oil, and heat-denatured completely. The sample was immediately transferred into a 
68 water bath and incubated for 20 hours (long hybridization [LH]). The reaction mixture 
was then subjected to a streptavidin treatment followed by phenol/chloroform extraction. 
This process was repeated three more times. Subtracted DNA was precipitated, dissolved in 
12 III H2O, mixed with 8 |j.1 driver DNA and 20 |j.1 of 2 x hybridization buffer, and subjected 
to a hybridization at 68 for 2 hours (short hybridization [SH]). After removal of 
biotinylated double-stranded DNA, subtracted cDNA was ligated into BamHI/XhoI site of 
chloramphenicol resistant pBCSK^ (Stratagene, La Jolla, CA 92037) and transformed into 
ElectroMax E, coli DHIOB cells by electroporation to generate a breast tumor specific 
subtracted cDNA library. 

To analyze the subtracted cDNA library, plasmid DNA was prepared from 100 
independent clones, randomly picked from the subtracted breast tumor specific library and 
characterized by DNA sequencing with a Perkin Elmer/Applied Biosystems Division 
Automated Sequencer Model 373A (Foster City, CA). Thirty-eight distinct cDNA clones 
were fovmd in the subtracted breast tumor-specific cDNA library. The determined 3' cDNA 
sequences for 14 of these clones are provided in SEQ ID NO: 1-14, with the corresponding 
5' cDNA sequences being provided in SEQ ID NO: 15-28, respectively. The determined one 
strand (5' or 3') cDNA sequences for the remaining clones are provided in SEQ ID NO: 29- 
52. Comparison of these cDNA sequences with known sequences in the gene bank using the 
EMBL and GenBank databases (Release 97) revealed no significant homologies to the 
sequences provided in SEQ ID NO: 3, 10, 17, 24 and 45-52. The sequences provided in SEQ 
ID NO: 1, 2, 4-9, 1 1-16, 1 8-23, 25-41 , 43 and 44 were found to show at least some degree of 
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homology to known human genes. The sequence of SEQ ID NO: 42 was found to show 
some homology to a known yeast gene. 

Data was analyzed using Synteni provided GEMTOOLS Software. Twenty 
one distinct cDNA clones were found to be over-expressed in breast tumor and expressed at 
low levels in all normal tissues tested. The determined partial cDNA sequences for these 
clones are provided in SEQ ID NO: 53- 73. Comparison of the sequences of SEQ ID NO: 
53, 54, and 68-71 with those in the gene bank as described above, revealed some homology to 
previously identified human genes. No significant homologies were found to the sequences 
of SEQ ID NO: 55-67, 72 (referred to as JJ 9434,71 17), and 73 (referred to as B535S). 

In a further experiment, cDNA fragments analyzed by DNA microarray were 
obtained from two subtraction libraries derived by conventional subtraction, as described 
above. In one instance the tester was derived from primary breast tumors. In the second 
instance, a metastatic breast tumor was employed as the tester. Drivers consisted of normal 
breast. 

cDNA fragments from these two libraries were submitted as templates for 
DNA microarray analysis. DNA chips were analyzed by hybridizing with fluorescent probes 
derived from mRNA from both tumor and normal tissues. Analysis of the data was 
accomplished by creating three groups from the sets of probes. The composition of these 
probe groups, referred to as Breast Tumor/mets, Normal non-breast tissues, and Metastatic 
breast ttimors. Two comparisons were performed using the modified Gemtools analysis. The 
first comparison was to identify templates with elevated expression in breast tumors. The 
second was to identify templates not recovered in the first comparison that yielded elevated 
expression in metastatic breast tumors. An arbitrary level of increased expression (mean of 
tumor expression versus the mean of normal tissue expression) was set at approximately 2.2. 

In the first round of comparison to identify overexpression in breast tumors, 
two novel gene sequences were identified, hereinafter referred to as B534S and B538S (SEQ 
ID NO: 89 and 90), and six sequences that showed some degree of homology to previously 
identified genes (SEQ ID NO: 74-79). Additionally, in a second comparison to identify 
elevated expression in metastatic breast tumors, five novel sequences were identified, 
hereinafter referred to as B535S (overexpressed in this analysis as well as what was described 
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above), B542S, B543S, P501S, and B541S (SEQ ID NO: 73, and 91-94), as well as nine gene 
sequences that showed some homology to known genes (SEQ ID NO: 80-88). Clone B534S 
and B538S (SEQ ID NO: 89 and 90) were shown to be overexpressed in both breast tumors 
and metastatic breast tumors. 

Example 2 

GENERATION OF HUMAN CD 8+ CYTOTOXIC T-CELLS THAT RECOGNIZE 
ANTIGEN PRESENTING CELLS EXPRESSING BREAST TUMOR ANTIGENS 

This Example illustrates the generation of T cells that recognize target cells 
expressing the antigen B51 IS, also known as 1016-F8 (SEQ ID NO: 56). Human CD8+ T 
cells were primed in-vitro to the B511S gene product using dendritic cells infected with a 
recombinant vaccinia virus engineered to express B511S as follows (also see Yee et al.. 
Journal of Inmiunology (1996) 157 (9):4079-86). Dendritic cells (DC) were generated from 
peripheral blood derived monocytes by differentiation for 5 days in the presence of 50 |j.g/ml 
GMCSF and 30 \xg/m\ IL-4. DC were harvested, plated in wells of a 24-weIl plate at a 
density of 2 x 10^ cells/well and infected for 12 hours with B511S expressing vaccinia at a 
multiplicity of infection of 5. DC were then matured overnight by the addition of 3 (ag/ml 
CD40-Ligand and UV irradiated at 100|iW for 10 minutes. CD8+ T cells were isolated using 
magnetic beads, and priming cultures were initiated in individual wells (typically in 24 wells 
of a 24-well plate) using 7x10^ CD8+ T cells and 1 x 10^ irradiated CD8-depleted PBMC; 
IL-7 at 10 ng/ml was added to cultures at day 1. Cultures were re-stimulated every 7-10 days 
using autologous primary fibroblasts retrovirally transduced with B511S and the 
costimulatory molecule B7.1. Cultures were supplemented at day 1 with 15 LU. of IL-2. 
Following 4 such stimulation cycles, CD8+ cultures were tested for their ability to 
specifically recognize autologous fibroblasts transduced with B511S using an interferon-y 
Elispot assay (see Lalvani et al J. Experimental Medicine (1997) 186:859-965). Briefly, T 
cells from individual microcultures were added to 96-well Elispot plates that contained 
autologous fibroblasts transduced to express either B511S or as a negative control antigen 
EGFP, and incubated overnight at 37° C; wells also contained IL-12 at 10 ng/ml. Cultures 
were identified that specifically produced interferon-y only in response to B51 IS transduced 
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fibroblasts; such lines were further expanded and also cloned by limiting dilution on 
autologous B-LCL retrovirally transduced with B51 IS. Lines and clones were identified that 
could specifically recognize autologous B-LCL transduced with B511S but not autologous B- 
LCL transduced with the control antigens EGFP or HLA-A3. An example demonstrating the 
ability of human CTL cell lines derived from such experiments to specifically recognize and 
lyse B5 1 1 S expressing targets is presented in Figure 1 . 

Example 3 
SYNTHESIS OF POLYPEPTIDES 

Polypeptides may be synthesized on an Perkin Elmer/Applied Biosystems 
Division 430A peptide synthesizer using FMOC chemistry with HPTU (O-Benzotriazole- 
N,N,N*,N'-tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly sequence 
may be attached to the amino terminus of the peptide to provide a method of conjugation, 
binding to an immobilized surface, or labeling of the peptide. Cleavage of the peptides from 
the solid support may be carried out using the following cleavage mixture: trifluoroacetic 
acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After cleaving for 2 hours, the 
peptides may be precipitated in cold methyl-t-butyl-ether. The peptide pellets may then be 
dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized prior to 
purification by CI 8 reverse phase HPLC. A gradient of 0%-60yo acetonitrile (containing 
0.1% TFA) in water (containing 0.1% TFA) may be used to elute the peptides. Following 
lyophilization of the pure fractions, the peptides may be characterized using electrospray or 
other types of mass spectrometry and by amino acid analysis. 

From the foregoing, it will be appreciated that, although specific embodiments 
of the invention have been described herein for the purposes of illustration, various 
modifications may be made without deviating from the spirit and scope of the invention. 
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CLAIMS 

1. An isolated polypeptide comprising an immunogenic portion of a 
breast protein or a variant of said protein that differs only in conservative substitutions and/or 
modifications, v^herein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of: (a) 
nucleotide sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94; 
(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. 

2. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide of claim 1 . 

3. An isolated polynucleotide molecule comprising a sequence provided 
in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94. 

4. An expression vector comprising a polynucleotide molecule according 
to any one of claims 2 and 3. 

5. A host cell transformed with the expression vector of claim 4. 

6. The host cell of claim 5 wherein the host cell is selected from the group 
consisting of E. coli, yeast and mammalian cell lines. 

7. A pharmaceutical composition comprising the polypeptide of claim 1 
and a physiologically acceptable carrier. 

8. A vaccine comprising the polypeptide of claim 1 and a non-specific 
immune response enhancer. 
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9. The vaccine of claim 8 wherein the non-specific immune response 
enhancer is an adjuvant. 

10. A vaccine comprising a polynucleotide molecule of any one of claims 
2 and 3 and a non-specific immune response enhancer. 

11. The vaccine of claim 10 wherein the non-specific immune response 
enhancer is an adjuvant. 

12. A pharmaceutical composition for the treatment of breast cancer 
comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
comprising an immimogenic portion of a breast protein, wherein said protein comprises an 
amino acid sequence encoded by a polynucleotide molecule comprising a sequence selected 
fi-om the group consisting of: (a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 1 1- 
16, 18-23, 25-44, 53, 54, 68-71, and 74-88; (b) complements of said nucleotide sequences; 
and (c) sequences that hybridize to a sequence of (a) or (b) under moderately stringent 
conditions. 

13. A vaccine for the treatment of breast cancer comprising a polypeptide 
and a non-specific immune response enhancer, said polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of: 
(a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68- 
71, and 74-88; (b) complements of said nucleotide sequences; and (c) sequences that 
hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

14. The vaccine of claim 13 wherein the non-specific immtrne response 
enhancer is an adjuvant. 

15. A vaccine for the treatment of breast cancer comprising a 
polynucleotide molecule and a non-specific immune response enhancer, the polynucleotide 
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molecule comprising a sequence selected from the group consisting of: (a) nucleotide 
sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88; 
(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. 

16. The vaccine of claim 15, wherein the non-specific immune response 
enhancer is an adjuvant. 

17. A pharmaceutical composition according to claims 7 or 12, for use in 
the manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

18. A vaccine according to any one of claims 8, 10, 13 or 15, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

19. A fusion protein comprising at least one polypeptide according to 

claim 1. 

20. A pharmaceutical composition comprising a fusion protein according 
to claim 19 and a physiologically acceptable carrier. 

21. A vaccine comprising a fusion protein according to claim 19 and a 
non-specific immune response enhancer. 

22. The vaccine of claim 21 wherein the non-specific immune response 
enhancer is an adjuvant. 

23. A pharmaceutical composition according to claim 20, for use in 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

24. A vaccine according to claim 21 , for use in the manufacture of a 
medicament for inhibiting the development of breast cancer in a patient. 



BNSDOCID: «WO 9933869A2_L> 



wo 99/33869 



36 



PCTAJS98/27416 



25. A method for detecting breast cancer in a patient, comprising: 

(a) contacting a biological sample from a patient with a binding agent 
which is capable of binding to a polypeptide, the polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94. complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; and 

(b) detecting in the sample a protein or polypeptide that binds to the 
binding agent, thereby detecting breast cancer in the patient. 

26. The method of claim 25 wherein the binding agent is a 
monoclonal antibody. 

27. The method of claim 26 wherein the binding agent is a 
polyclonal antibody. 

28. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) contacting a biological sample from a patient with a binding agent that 
is capable of binding to a polypeptide, said polypeptide comprising ari immunogenic portion 
of a breast protein, wherein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94, complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; 

(b) determining in the sample an amount of a protein or polypeptide that 
binds to the binding agent; 

(c) repeating steps (a) and (b); and 

(d) comparing the amount of polypeptide detected in steps (b) and (c) to 
monitor the progression of breast cancer in the patient. 
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29. A monoclonal antibody that binds to a polypeptide comprising an 
immunogenic portion of a breast protein or a variant of said protein that differs only in 
conservative substitutions and/or modifications, wherein said protein comprises an amino 
acid sequence encoded by a polynucleotide molecule comprising a sequence selected from 
the group consisting of: (a) nucleotide sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45- 
52, 55-67, 72, 73, and 89-94: (b) complements of said nucleotide sequences; and (c) 
sequences that hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

30. A monoclonal antibody according to claim 29, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

31. The monoclonal antibody of claim 30 wherein the monoclonal 
antibody is conjugated to a therapeutic agent. 

32. A method for detecting breast cancer in a patient conlprising: 

(a) contacting a biological sample from a patient with at least two 
oligonucleotide primers in a polymerase chain reaction, wherein at least one of the 
oligonucleotides is specific for a polynucleotide molecule encoding a polypeptide comprising 
an immunogenic portion of a breast protein, said protein comprising an amino acid sequence 
encoded by a polynucleotide molecule comprising a sequence selected from the group 
consisting of nucleotide sequences recited in SEQ ID NO: 1-94, complements of said 
nucleotide sequences and sequences that hybridize to a sequence of SEQ ID NO: 1-94 under 
moderately stringent conditions; and 

(b) detecting in the sample a polynucleotide sequence that amplifies in the 
presence of the oligonucleotide primers, thereby detecting breast cancer. 

33. The method of claim 32, wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQ ID NOS: 1 -94. 
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34. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies of claim 29; and 

(b) a detection reagent. 

35. A diagnostic kit comprising: 

. (a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a polynucleotide molecule comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOS: 1, 2, 4-9, 1 1-16, 18-23, 25-44, 53, 54, 68-71, and 74-88, 
complements of said sequences and sequences that hybridize to a sequence of SEQ ID NO: 1, 
2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, or 74-88 under moderately stringent conditions; 
and 

(b) a detection reagent. 

^ 36. The kit of claims 34 or 35 wherein the monoclonal antibodies are 
immobilized on a solid support. 

37. The kit of claim 36 wherein the solid support comprises nitrocellulose, 
latex or a plastic material. 

38. The kit of claims 34 or 35 wherein the detection reagent comprises a 
reporter group conjugated to a binding agent. 

39. The kit of claim 38 wherein the binding agent is selected from the 
group consisting of anti-immunoglobulins, Protein G, Protein A and lectins. 

40. The kit of claim 38 wherein the reporter group is selected from the 
group consisting of radioisotopes, fluorescent groups, luminescent groups, enzymes, biotin 
and dye particles. 

41. A diagnostic kit comprising at least two oligonucleotide primers, at 
least one of the oligonucleotide primers being specific for a polynucleotide molecule 
encoding a polypeptide comprising an immunogenic portion of a breast protein, said protein 



.9933869A2_L> 



wo 99/33869 



PCT/US98/27416 



39 

comprising an amino acid sequence encoded by a polynucleotide molecule comprising a 
sequence selected from the group consisting of nucleotide sequences recited in SEQ ID NOS: 
1-94, complements of said nucleotide sequences and sequences that hybridize to a sequence 
of SEQ ID NO: 1-94 under moderately stringent conditions. 

42. A diagnostic kit of claim 41 wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQ ID NOS: 1-94. 

43. A method for detecting breast cancer in a patient, comprising: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide probe specific 
for a polynucleotide molecule encoding a polypeptide comprising an immunogenic portion of 
a breast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, complements of said nucleotide sequences and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions; and 

(c) detecting in the sample a polynucleotide sequence that hybridizes to 
the oligonucleotide probe, thereby detecting breast cancer in the patient. 

44. The method of claim 43 wherein the oligonucleotide probe comprises 
at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a sequence 
selected from the group consisting of SEQ ID NOS: 1-94. 

45. A diagnostic kit comprising an oligonucleotide probe specific for a 
polynucleotide molecule encoding a polypeptide comprising an immunogenic portion of a 
breast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, complements of said nucleotide sequences, and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions. 
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46. The diagnostic kit of claim 45, wherein the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a 
sequence selected from the group consisting of SEQ ID NOS: 1-94. 

47. Peripheral blood cells from a patient incubated in the presence of at 
least one polypeptide of claim 1 , such that T cells proliferate, for use in the manufacture of a 
medicament for treating breast cancer in a patient. 

48. The blood cells of claim 47 wherein the T cells is repeated one or 

more times. 

49. A composition for the treatment of breast cancer in a patient, 
comprising T cells proliferated in the presence of a polypeptide of claim 1 , in combination 
with a pharmaceutically acceptable carrier. 

50. An antigen presenting cells incubated in the presence of at least one 
polypeptide of claim 1, for use in the manufacture of a medicament for treating breast cancer 
in a patient. 

51. The cells of claim 50 wherein the antigen presenting cells are selected 
from the group consisting of dendritic and macrophage cells. 

52. A composition for the treatment of breast cancer in a patient, 
comprising antigen presenting cells incubated in the presence of a polypeptide of claim 1, in 
combination with a pharmaceutically acceptable carrier. 
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SEQUENCE LISTING 



<110> Corixa Corporation 



<130> 210121. 446PC 

<140> PCX 

<141> 1998-12-22 

<160> 94 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 402 

<212> DNA 

<213> Homo sapiens 



<400> 1 

tttttttttt 

aaaacaaatn 

ctttcaatat 

gttaacttgc 

agttnaatta 

tcnttgccna 

ngcagnccnc 



tttttaggag 
atacgaaatn 
tttacaaaat 
tgcaatnaat 
ctaattttaa 
tacatacnta 
aaaaatctta 



aactgaatca 
ngtcataaga 
gctcacgcag 
gcaactttaa 
tgatnttnct 
ttttctttac 
ccggttaatt 



aacagatttt 
aatgctttct 
caaatatgaa 
canacataca 
caagatnttt 
ttttttttta 
acacggggtt 



attcaacttt 
tataccacta 
aagctncaac 
aatttcttct 
attcatatac 
cnatnggcca 

gt 



ttagatgagg 60 
tctcaaacca 120 
acttcccttt 180 
gtatcttaaa 240 
ttttaatgac 300 
acagctttca 360 
402 



<210> 2 

<211> 424 

<212> DNA 

<213> Homo sapiens 



<400> 2 

tttttttttt 

ttcgttggca 

ncatttccaa 

aattctttgc 

gcaaacncag 

gganccgccc 

ttaaacttgg 

gggc 



ttttttaaag 
tcttctctgt 
ctnagcccac 
tgganacaaa 
attgaaggga 
agggncattt 
tgcaaaaggc 



gtacacattt 
gatgggcagc 
gctttcaacc 
gaactacatt 
anaagganac 
cctcaccatg 
gcaattggtg 



ctttttcatt 
ttgctaaaat 
nngccnaaca 
cctttgtaaa 
ttntggggac 
cttaatcttg 
nanggaaccc 



ctgtttnatg 
tanactcagg 
aagaaaatca 
tnatgctttg 
ggaaacaact 
cnctcacttg 
cacaccttcc 



cagcaaataa 60 
ccccttagct 120 
gttngggtta 180 
tttgctctgt 240 
ngnagaagca 3 00 
cngggcacca 36 0 
ttaaaaagca 420 
424 



<210> 3 

<211> 421 

<212> DNA 

<213> Homo sapiens 

<400> 3 

aatna^^^" ""^^^^^^ tttaaaaaag cctttttcat acttcaatta caccanactt 60 

^^^n^^^^^^ tgagtaaatc ngacattatt atttnaaaat ttgcatattt aaaatttgna 120 

ctJto^on^^ ccagactgtt tgcanaatga agggaggatc actcaagngc tgatctcnca 180 

ctntctgcag tctnctgtcc tgtgcccggn ctaatggatc gacactanat ggacagntcn 24 0 
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cagatcttcc gttcttntcc cttccccaat 
tttggggaca tgntaatttt gcnatcctta 
gggtggttaa tgttcgncng gcttnttgac 
c 



ttcncaccnc tccccttctt ncccggatcn 300 
aaccctgccc gccangggtc ccnanctcag 360 
cncctgcgcc ctttnantcc naaccccaag 420 

421 



<210> 4 
<211> 423 
<212> DNA 

<213> Homo sapiens 



<400> 4 

tttttttatt tttttttcta tttntnntat 
tgtgtatgcg tangtacnta tgtntgcata 
aaaatctcaa natngtantt ggttnatggg 
atggacacng tgaaatgtag ccgctnatca 
cctccnaata aaaatnccng gccctactgg 
acatgcanac nagttaaacc tgtgnactgg 
ttcnccccan ggacantcng aattttttta 
tgt 

<210> 5 
<211> 355 
<212> DNA 

<213> Homo sapiens 



ttnntgnggt tcctgtgtgt aattagnang 60 
tttaacctgt tncctttcca tttttaaaat 120 
agtaaanaga gactatngat naattttaac 18 0 
ntttaaaact tcattttgaa ggccttttnc 240 
gttaagcaac attgcatntc taaagaaacc 300 
tcangcaaac cnanntggaa nanaagggnn 36 0 
acaaattacn atnccccccc ngggggagcc 420 

423 



<400> 5 

acgaccacct natttcgtat ctttcaactc 

tccaggaaga caggtctcaa cttagggatc 

gcaacctggc acttcaagga agtgcaccga 

aggtggccaa ctgatcactg taggagctga 

agtgaccaan acnaccattn aggatcaccc 

ccaaacggct ggccaatggg ggggtttaat 



ttttcgaccg gacctcttat tcggaagcgt 60 
agatcacgtt atcaacgctc tgggatcgct 120 
tnacgtctag accggccaac acagatctag 180 
ctggcaanan tcaaccgggc cccaaccnag 24 0 
acaggcactc ctcgtcctag ggccaaccna 300 
atttggttna aaaattgatt ttaaa 355 



<210> 6 

<211> 423 

<212> DNA 

<213> Homo sapiens 



<400> 6 

tttttttttt tttttggaca ggaagtaaaa 

acattggaag ccctcatgan tgcagggccc 

tgtacttaac cccacagccn tccgggatna 

catcagcatt aaacttggta aanccccact 

naaacttgaa cttggccctg cgcagggcct 

tgcgnaagga cntaatnact tggccnatgt 

caaaggcacc tcgcaagcct ntttggancc 
ttt 



tttattggtn antattaana ggggggcagc 6 0 
gccacttgtc cagagggcca cnattgggga 12 0 
gccgcttttc agccaccatn tcttcaaatt 180 
tctttaagat ntgnatcttc tggcggccag 24 0 
caatcacatg ctccttgttc tgcagcttgg 300 
gaaccctggc cacantgccc tggggctttc 36 0 
tgnccgcccc ngcacaggga caacatcttg 42 0 

423 



<210> 7 

<:211> 410 

<212> DNA 

<213> Homo sapiens 



<400> 7 

ttcgcactgg ctaaaacaaa ccgccttgca aagttngaaa aatttatcaa tggaccaaat 60 
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aatgctcata 
nanttgttgt 
gtaaatatca 
gaggtctgct 
catattgttg 
tttttnnaaa 



tccnacaagt tggtgaccgt tnttatnata aaaaaatgta tnatgctcct 120 

acaataatgt tccaatttng gacnttcggc atctaccctg gttcacctgg 180 

ggcagctttt gatggggcta ggaaagctaa cagtactcga acatgggaaa 24 0 

tcgccngtgt anatgggaaa naattccgtc ttgctcngat ttgtggactt 300 

tacatgcaga tgaatnngaa gaacttgtca actactatca ggatcgtggc 360 

agctnatcac catgttggaa gcggcactng gacttgagcg 410 



<210> 8 

<211> 274 

<212> DNA 

<213> Homo sapiens 

<400> 8 

tttttttttt tttttaggtc atacatattt tttattataa canatatntg tatatacata 60 

taatatatgt gtatatatcc acgtgtgtgt gtgtgtatca aaaacaacan aantttagtg 120 

atctatatct ntngctcaca tatgcatggg agataccagt aaaaaataag tnaatctcca 18 0 

taatatgttt taaaactcan anaaatcnga gagactnaaa gaaaacgttn atcannatga 24 0 
ttgtngataa tcttgaanaa tnacnaaaac atat 274 

<210> 9 

<211> 322 

<212> DNA 

<213> Homo sapiens 



<400> 9 

tttttttttt 

ataagagtga 

gtttttcacc 

ttaaaaaagt 

atttngttca 

cctnnanata 



ttttgtgcct 
taagtgtgaa 
attaantgta 
tttcctctnt 
gtacttttnc 
tgatggatna 



tattgcaccg 
aatccttgcc 
ataatggctn 
ccctngttat 
ctgcacctat 
ca 



gcnanaactt 
ttctctttaa 
tatgtatttt 
ctaanagttt 
agatatgatn 



<210> 10 

<211> 425 

<212> DNA 

<213> Homo sapiens 



<400> 10 

tttttttttt 

ggtcacaaaa 

ttctatattt 

cttctcaggc 

ggccaaggat 

actatctnaa 

caaccggaag 

caggn 



ctagcactat 
tcttaatgna 
tatnnatggt 
tnatcaaaaa 
ctgttatttt 



attaaactca 60 
naggcatctg 120 
cttnatggag 180 
tgggtataat 240 
ttcttcttng 300 
322 



tttttattct 
taagtacaga 
acaatatctc 
agcagcgcct 
caggaaacag 
ctgctagttt 
catcctagat 



gcagccatta 
atatnacaca 
tggaattcca 
tctggcaacc 
aanaatggaa 
tcaggcttta 
ggtacactct 



aatgctgaac 
cgccctgccc 
ccttcccttc 
ataagaacca 
gnagcccccn 
aatcatgtaa 
ctccaggtgc 



actagatnct 
ataaaaagca 
taatttgact 
acntgnggac 
tgacnctatt 
natacgtgtc 
caggaaaaga 



tatttgtgga 60 
cagctcccag 120 
aatatttctg 180 
taggtcggtg 24 0 
aanctntnaa 300 
cttnttgctg 360 
tcccaaatng 420 
425 



<210> 11 

<211> 424 

<212> DNA 

<213> Homo sapiens 

<400> 11 

ttttnttant ttttttancc nctnntccnn tntgttgnag ggggtaccaa atttctttat 60 
ttaaaggaat ggtacaaatc aaaaaactta atttaatttt tnggtacaac ttatagaaaa 120 
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ggttaaggaa accccaacat gcatgcactg 
ntggggaaat tagcccaang ctnagctttc 
aaaaaaattt nccgccnagc cnaatccggg. 
gtcccccnat tctttaangg cttncacctn 
ccacngatgg gggtcatttt tntcnnttag 
ccan 



ccttggtaac cagggnattc ccccncggct 180 
attatcactn tcccccaggg tntgcttttc 240 
cnctcccatc tggcgcaant tggtcacttg 300 
ctcattcggg tnatgtgtct caattaaatc 360 
ccagtttgtg nagttccgtt attganaaaa 420 

424 



<210> 12 

<211> 426 

<212> DNA 

<213> Homo sapiens 



<400> 12 

tttttttttt ttttncttaa aagcttttat ctcctgctta cattacccat ctgttcttgc 60 

atgttgtctg ctttttccac tagagccctt aacaacttaa tcatggttat tttaagggct 120 

ctaataattc cnaaactggt atcataaata agtctcgttc tnatgcttgt tttctctcta 180 

tcacactgtg ttngttgctt tttnacatgc tttgtaattt ttggctgaaa gctgaaaaat 24 0 

nacatacctg gttntacaac ctgaggtaan cagccttnta gtgtgaggtt ttatatntta 300 

ctggctaaga gctnggcnct gttnantant tgttgtanct ntatatgcca naggctttna 360 

tttccnctng tgtccttgct tnagtacccc attnttttag gggttcccta naaactctat 420 

ctnaat 426 

<210> 13 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<400> 13 

tttttttttt tttttnagat agactctcac tctttcgccc aggctggagt gcagtggcgc 60 

aatcaaggct cactgcaacc tctgccttat aaagcatttn ctaaaggtac aagctaaatt 120 

ttaaaaatat ctctncacaa ctaatgtata acaaaaatta gttctacctc ataaacncnt 18 0 

ggctcagccc tcgnaacaca tttccctgtt ctcaactgat gaacactcca naaacagaac 24 0 

anatntaagc ttttccaggc ccagaaaagc tcgcgagggg atttgctntg tgtgtgacac 300 

acttgccacc ctgtggcagc acagctccac acntgctttg ggccgcattt gcaagttctc 36 0 

tgtaancccc ctgnaagacc cggatcagct gggtngaaat tgcangcnct cttttggca 419 

<210> 14 

<211> 400 

<212> DNA 

<213> Homo sapiens 



<400> 14 

aanccattgc caagggtatc cggaggattg 
ccctcaggaa agcaaagagc ttgaaaaatg 
tcanactgct ccaacaagga tntgcanagg 
actgcagtcn tcccccantg gcagaaggat 
gaaggtcgtg gatnacttgg accgagcctc 
aagacaaagc anttcatcga cgccaacccc 
gcggcgcccc cgcccagggc cttaataanc 



tggctgtcac aggtnccgag gcccanaagg 60 

tctctctgtc atggaagccn aagtgaaggc 120 

gagatcgcta accttggaga ggccctggcc 180 

gaattgcggg agactctcan atcccttang 240 

nnaagccaat ntccagaaca agtgttggag 3 00 

naccggcctc tnttctcctg ganattgana 360 

cntgaagctn 4 00 



<210> 15 

<211> 395 

<212> DNA 

<213> Homo sapiens 
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<400> 15 

tgctttgctg 

agcgagatta 

tgatgaagca 

tactgaagca 

catgaaagcn 

ctntgtattt 

aagttcaaga 



cgtccaggaa 
gacntactga 
attgtgaagc 
ggtatgttcg 
gtcagaanag 
actgttggan 
aaattatgtt 



gattagatng 
anatccatgc 
tatcggatgg 
caattcgtgc 
tggctnattc 
ttttgatgct 
agttttggcc 



aanaatacat 
aggtcccatt 
ctttnatgga 
tgatcatgat 
tnaaagctgg 
gcatgacaga 
attat 



attgatttgc 
acaaagcatg 
gcagatctga 
tttgtagtac 
agtctaaatt 
ttttgcttan 



caaatgaaca 60 
gtgaaataga 120 
gaaatgtttg 180 
aggaagactt 240 
ggacnacnac 300 
tgtaaaaatn 360 
395 



<210> 16 
<211> 404 
<212> DNA 

<213> Homo sapiens 



<400> 16 

ccaccactaa 

nagggcacac 

ttcacctctt 

ctnccatgta 

gtaggagctg 

tgcccctgct 

ntctcanggc 



aatcctggct 
cctttaagtn 
gggcttgagt 
cgtgctgcca 
caggcnagaa 
tctcaaggag 
ttccctnccg 



gagccctacn 
aggtgacagg 
acctggttct 
atngtcttga 
ngaagcggtt 
gggtgtgggg 
gctnancgca 



agtacctgtg 
tcacctttaa 
cgtgccctga 
tcttctccac 
aacagcgcca 
ttcctccacc 
ngacttaagc 



cccctccccc 
gtgaggacag 
ggcgacnctn 
gccnctnaac 
ctccatagcc 
atcgccgccc 
atgg 



aggacgagat 6 0 
tcagctnaat 120 
agccctgcag 18 0 
ttgggcttca 24 0 
gcagccnggc 300 
ttgcaaacac 360 
404 



<210> 17 

<211> 360 

<212> DNA 

<213> Homo sapiens 



<400> 17 

ggccagaagc 

gctggcagca 

acctcatacc 

cggggagaac 

ctctncaatt 

cccgcaatta 



tttccacaaa 
gctgctatct 
tnaccgcttc 
catacttatt 
actntctccn 
cctggcaaaa 



ccagtgaagg 
ngatngacng 
cctttnaatg 
cccctnttcc 
ctgctttntt 
ctcgcgaccc 



tggcagcaaa 
cagaaaccaa 
gccttcggtg 
cggcctacca 
ctnancacta 
ttcgggcagc 



gaaagcctct 
ccactaattc 
tgtgcgcaca 
cctctnctcc 
ctgctngtnt 
gctaaanaat 



tagacnagga 60 
agcaaacaca 12 0 
tgggcacgtg 180 
cccttctctt 240 
cnanagccng 3 00 
gcacatttac 360 



<210> 18 
<211> 316 
<212> DNA 

<213> Homo sapiens 



<400> 18 

atacatatac 

tgtgtgtata 

agccagtttt 

taatgttnag 

cattcatgtt 

tcgatnttat 



acatatatga ttttagatag agccatatac ctngaagtag tanatttgtt 60 

tgtatgtgtc tactcatttt aaataaactt gtgatagaga tgtaattntg 120 

tcatttgctt aaatnactca ccaagtaact aattaagttn tctttactct 180 

tagtgagatt ctgttgaagg tgatattaaa aaccattcta tattaattaa 240 

gttttttaaa agcttatttg aaatcnaatt atgattattt ttcataccag 300 
gtangt 316 



<210> 19 
<2ll> 350 
<212> DNA 

<213> Homo sapiens 



<400> 19 
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aagggatgca nataatgctg tgtatgagct 
tactattgaa catgctnggg ctcggtcacg 
ccgttttagt agtcgcagac ctcgaaatga 
anatcgtctt atagttgaga atttatcctc 
agttttgagt tnttttgatg tggcttttta 
ttttattaaa agtagttttn aattaatgga 



tgatggaaaa gaactctgta gtgaaagggt 60 
aggtggaaga ggtagaggac gatactctga 120 
tagacgaaat gctccacctg taagaacaga 180 
aagagtcagc tggcaggttt gttganatac 24 0 
aaaaagttat gggttactna tgttatattg 300 
tntgatggaa ttgttgtttt 350 



<210> 20 

<211> 367 

<212> DNA 

<213> Homo sapiens 



<400> 20 

gntnnncnca agatcctnct ntcccccngg 
ntaanatcnn gccgcncccg aagtctcnct 
ncaattntga cctnnggcga anaatggcng 
tagnatctga ccactangac ccnctatcct 
ccaattagtg catgntanag cntcctggcc 
cgcccctacc angncatccn catctactag 
ccccnct 



gcngccccnc cnccngtnat naccggtttn 6 0 
nntgccgaga tgncccttat ncncnnatgn 120 
nngtgtatca gtntccnctc tgnggnctct 180 
ctcaaaccct gtanncngcc ctaatttgtg 24 0 
cagatggcnt ccatatcctg gtncggcttc 300 
agcttatccg ctncntgngg cgcaccggnt 36 0 

367 



<210> 21 

<211> 366 

<212> DNA 

<213> Homo sapiens 

<400> 21 

cccaacacaa tggtctaagt anaactgtat 
ctacaatggg aaaatccata cataagtcag 
tcctttatct tctgaagaaa gtacacacct 
tccaggcatc agttggatga ttcatcatgg 
tgtcatacga aaacaccagt tctgcccnna 
tgggaatatt cactctagac cagaaacagc 
aatttn 



tgctctgtag tatagttcca cattggcaac 60 
ttacttcctn atgagctttc tccttctgaa 120 
tggtnatgat atctttgaat tgcccttctt 180 
taattatggc attatcatat tcttcatact 240 
gatgagcttg ttctgcagct cttagcacct 300 
tcccggtgct ccctcatttt ctgaggctta 360 

366 



<210> 22 

<211> 315 

<212> DNA 

<213> Homo sapiens 



<400> 22 

acttaatgca atctctggag gataatttgg 
gaagaaatna ctgggtnata tttcaatatt 
caatatattt gtnaaaactg agatacangt 
cnaatnnatt ctatttnaat gttgtttcag 
atacctnaat atcacacttn tncttnnaat 
gcatataatg tcgac 



atcaagaaat aaagaanaaa tgaattagga 60 
ttagaacttt aanaatgttg actatgattt 120 
ttgacctata tctgcatttt gataattaaa 180 
agtcacagca cagactgaaa ctttttttga 240 
gatgttgaag acaatgatga catgccttna 300 

315 



<210> 23 

<2ll> 202 

<212> DNA 

<213> Homo sapiens 



<400> 23 

actaatccag tgtggtgnaa ttccattgtg ttgggcaact caggatatta aatttatnat 60 
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ttaaaaattc ccaagagaaa naaactccag 
aaatgttnca nnaaganatg acgctgattc 
acacccaccg nttcatttta tg 

<210> 24 

<211> 365 

<212> DNA 

<213> Homo sapiens 



gccctgattg tttcactggg gaattttacc 120 
tgtnaaatct ttttcagaag atagaggaga 180 

202 



<400> 24 

ggatttcttg cccttttctc cctttttaag 
ctttctgcca tacaaccgct accacatctg 
ggatctcgga accnagtgtt nacttcattt 
tggtctgtct gtatteacca tggggcctgt 
gtgccctcca ttgtggccac atgggagacc 
acggcactgc atctnacccg acntgggatt 
gatca 



tatcaatgta tgaaatccac ctgtaccacc 60 
gctcctagaa cctgttttgc tttcatagat 120 
ttaaacccca ttttagcaga tngtttgctn 180 
acacaccacg tgtggttata gtcaaacaca 240 
catnacccna tactgcatcc tgggctgatn 3 00 
gaacccgggg tgggcagcng aattgaacag 360 

365 



<210> 25 

<211> 359 

<212> DNA 

<213> Homo sapiens 



<400> 25 

gtttcctgct tcaacagtgc ttggacggaa cccggcgctc gttccccacc ccggccggcc 60 
gcccatagcc agccctccgt cacctcttca ccgcaccctc ggactgcccc aaggcccccg 120 
ccgccnctcc ngcgccncgc agccaccgcc gccnccncca cctctccttn gtcccgccnt 18 0 
nacaacgcgt ccacctcgca ngttcgccng aactaccacc nggactcata ngccgccctc 24 0 
aaccgcccga tcaacctgga gctctncccc ccgacnttaa cctttccntg tcttacttac 300 
nttaaccgcc gnttattttg cttnaaaaga acttttcccc aatactttct ttcaccnnt 359 

<210> 26 

<211> 400 

<212> DNA 

<213> Homo sapiens 



<400> 26 

agtgaaacag tatatgtgaa aaggagtttg 
ttataatttc caataggata ctcatcagtt 
ggtttctggt ttcagatttg aactctcaag 
acnacnaana aatctnaacn aacngaanac 
ttcagccacg aggaaaacta tcnccctnaa 
aatcacagtg aatcatagcc caagatcagc 
acttacaggg accacttcac agtnngtnga 

<210> 27 

<211> 366 

<212> DNA 

<213> Homo sapiens 



tgannagcta cataaaaata ttagatatct 60 
ttgaataana gacatattct agagaaacca 120 
agcttggaag ttatcactcc catcctcacg 180 
caatgacttt tcttagatct gtcaaagaac 24 0 
tactggggac tggaaagaga gggtacagag 3 00 
ttgcccggag ctnaagctng tacgatnatt 36 0 
tnaantgccn 4 00 



<400> 27 

gaatttctta gaaactgaag tttactctgt tccaagatat atcttcactg tcttaatcaa 60 
agggcgctng aatcatagca aatattctca tctttcaact aactttaagt agttntcctg 120 
gaattttaca ttttccagaa aacactcctt tctgtatctg tgaaagaaag tgtgcctcag 180 
gctgtagact gggctgcact ggacacctgc gggggactct ggctnagtgn ggacatggtc 24 0 
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agtattgatt ttcctcanac tcagcctgtg tagctntgaa agcatggaac agattacact 300 
gcagttnacg tcatcccaca catcttggac tccnagaccc ggggaggtca catagtccgt 360 
tatgna - 366 

<210> 28 

<211> 402 

<212> DNA 

<213> Homo sapiens 

<400> 28 

agtgggagcc tcctccttcc ccactcagtt ctttacatcc ccgaggcgca gctgggcnaa 60 
ggaagtggcc agctgcagcg cctcctgcag gcagccaacg ttcttgcctg tggcctgtgc 12 0 
agacacatcc ttgccaccac ctttaccgtc catcangcct gacacctgct gcacccactc 180 
gctngctttt aagccccgat nggctgcatt ctgggggact tgacacaggc ncgtgatctt 24 0 
gccagcctca ttgtccaccg tgaagagcat ggcaaaaagt ctgaggggag tgcatcttga 300 
anagcttcaa ggcttcattc agggccttng ctnaggcgcc nctctccatc tccnggaata 36 0 
acnagaggct ggtnngggtn actntcaata aactgcttcg tc 4 02 

<210> 29 

<211> 175 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cggacgggca tgaccggtcc ggtcagctgg gtggccagtt tcagttcttc agcagaactg 60 

tctcccttct tgggggccga gggcttcctg gggaagagga tgagtttgga gcggtactcc 12 0 

ttcagccgct gcacgttggt ctgcagggac tccgtggact tgttccgcct cctcg 175 

<210> 30 

<211> 360 

<212> DNA 

<213> Homo sapiens 

<400> 30 

ttgtatttct tatgatctct gatgggttct tctcgaaaat gccaagtgga agactttgtg 6 0 
gcatgctcca gatttaaatc cagctgaggc tccctttgtt ttcagttcca tgtaacaatc 120 
tggaaggaaa cttcacggac aggaagactg ctggagaaga gaagcgtgtt agcccatttg 180 
aggtctgggg aatcatgtaa agggtaccca gacctcactt ttagttattt acatcaatga 24 0 
gttctttcag ggaaccaaac ccagaattcg gtgcaaaagc caaacatctt ggtgggattt 300 
gataaatgcc ttgggacctg gagtgctggg cttgtgcaca ggaagagcac cagccgctga 36 0 

<210> 31 

<211> 380 

<212> DNA 

<213> Homo sapiens 

<400> 31 

acgctctaag cctgtccacg agctcaatag ggaagcctgt gatgactaca gactttgcga 60 

acgctacgcc atggtttatg gatacaatgc tgcctataan cgctacttca ggaagcgccg 120 

agggaccnaa tgagactgag ggaagaaaaa aaatctcttt ttttctggag gctggcacct 180 

gattttgtat ccccctgtnn cagcattncn gaaatacata ggcttatata caatgcttct 240 

ttcctgtata ttctcttgtc tggctgcacc ccttnttccc gcccccagat tgataagtaa 300 

tgaaagtgca ctgcagtnag ggtcaangga gactcancat atgtgattgt tccntnataa 360 

acttctggtg tgatactttc 380 
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<210> 32 

<211> 440 

<212> DNA 

<213> Homo sapiens 



<400> 32 

gtgtatggga 

ccaccccctg 

cctcacttct 

ccctgaagtc 

caggggagac 

gtganaagca 

cttggagaac 

caccctttgg 



gcccctgact 
cacctccacc 
ggggtggatg 
ttacggtcca 
cgtntcanta 
cgtggcggct 
ctcagtcccn 
gggttggggt 



cctcacgtgc 
tgccccagcc 
atgtgacctt 
acatcaggac 
gggatgtgtg 
tctgggggcc 
gtagccccct 



ctgatctgtg 
cctgcctctg 
cctnggggga 
caagtcccat 
cctggctgtg 
atgtttgggg 
gccctggcac 



ccct tggtcc 
ccccaagtgg 
ctgcggaagg 
ggacatgctg 
tacgtgggtg 
aaggaagtgt 
agcngcatnc 



caggtcaggc 60 
ggccagctgc 120 
gacaagggtt 180 
acagggtccc 24 0 
tgcagtgcac 300 
gcccnccacc 360 
acttcaaggg 420 
440 



<210> 33 

<211> 345 

<212> DNA 

<213> Homo sapiens 



<400> 33 

tattttaaca 

gattatttgg 

ctcattttgt 

gtgcagaagg 

aggctggaag 

tggtctgctg 



atgtttatta 
agtgggtccc 
gacttaggtg 
aggacctagg 
acaaaagtct 
ggaatgttgt 



<210> 34 

<211> 440 

<212> DNA 

<213> Homo sapiens 



<400> 34 

tgtaattttt 

ccataagcag 

tgtttctggt 

ccgttttgca 

gttacaactc 

aaaggcataa 

agggcttana 

ataaggtttt 



ttcatttatc 
aacctagggc 
ggggcatggt 
cacacacata 
cccaataaag 
tgttggggtg 



cctctataga 
ctggactctg 
tcagtcagaa 
tggtggccac 
gcacttttac 
gggaagantt 



accaccaccc 
aaatctaact 
ctggtgtctc 
acccaggagg 
ctcaaagang 
atttc 



acaccgagga 60 
ccccacttcc 120 
ctattggatc 180 
gttgattggc 24 0 

gggtgggagt 3 00 

345 



ttattggaaa 
attttaagtg 
acaggaccag 
atagtgcaac 
cacacttcaa 
cccagatgtt 
ttttcccaga 
aggacaatga 



acaaatatac 
gctaaacaaa 
cagtacaaaa 
ttttaagtac 
caacaacatg 
ccctcatttg 
gtaagccnca 



aacttggaat 
gtttaaaaag 
aaatagtgta 
atattgttga 
ctgacagttc 
accaactcca 
tgcaacatgt 



ggattttgag 
caagtaacaa 
cgagtacctg 
ctgtccatag 
ctaaagaaaa 
tctnagttta 
tacttgatca 



gcaaattgtg 60 
taaaagaaaa 120 
gataatacac 180 
tccacgcaga 240 
ctactttaaa 300 
gatgtgcaga 36 0 
attttctaaa 420 
440 



<210> 35 

<211> 540 

<212> DNA 

<213> Homo sapiens 



<400> 35 

atagatggaa 

gtactaacaa 

ctgtggcttt 

cttttaaaaa 

ataaaacaat 

gtataaggga 



tttattaagc 
aaactctagc 
taaaatttgg 
atgctttagg 
cacaatttaa 
aaaggtggta 



ttttcacatg 
aatcaagaat 
ttttcataag 
tcactccaag 
taaataacaa 
gtgttganta 



tgatagcaca 
ggcagcatgt 
ataatttata 
cttggcagtt 
atacaacatt 
agcagttatt 



tagttttaat 
tattttataa 
ctgaagtaaa 
aacatttggc 
gtaggccata 
agaatagaat 



tgcatccaaa 60 
caatcaacac 120 
tctagccatg 180 
ataaacaata 240 
atcatataca 300 
accttggcct 360 
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ctatgcaaat atgtctagac actttgattc actcagccct gacattcagt tttcaaagtt 420 
aggaaacagg ttctacagta tcattttaca gtttccaaca cattgaaaac aagtagaaaa 480 
tgatganttg atttttatta atgcattaca tcctcaagan ttatcaccaa cccctcaggt 540 

<210> 36 
<211> 555 
<212> DNA 

<213> Homo sapiens 
<400> 36 

cttcgtgtgc ttgaaaattg gagcctgccc ctcggcccat aagcccttgt tgggaactga 60 

gaagtgtata tggggcccaa nctactggtg ccagaacaca gagacagcag cccantgcaa 120 

tgctgtcgag cattgcaaac gccatgtgtg gaactaggag gaggaatatt ccatcttggc 180 

agaaaccaca gcattggttt ttttctactt gtgtgtctgg gggaatgaac gcacagatct 24 0 

gtttgacttt gttataaaaa tagggctccc ccacctcccc cntttctgtg tnctttattg 300 

tagcantgct gtctgcaagg gagcccctan cccctggcag acananctgc ttcagtgccc 360 

ctttcctctc tgctaaatgg atgttgatgc actggaggtc ttttancctg cccttgcatg 420 

gcncctgctg gaggaagana aaactctgct ggcatgaccc acagtttctt gactggangc 480 

cntcaaccct cttggttgaa gccttgttct gaccctgaca tntgcttggg cnctgggtng 540 
gnctgggctt ctnaa 555 

<210> 37 

<211> 280 

<212> DNA 

<213> Homo sapiens 



<400> 37 

ccaccgacta taagaactat gccctcgtgt 
acgtggattt tgcttggatc ttggcaagaa 
ctctaaaaaa tatcctgact tctaataaca 
aggtgaactg ccccnagctc tcgtaaccag 
ttncttctgc ttcgctttcc cctaccccac 



attcctgtac ctgcatcatc caactttttc 60 
accctaatct ccctccagaa acagtggact 120 
ttgatntcaa gaaaatgacg gtcacagacc 18 0 
gttctacagg gaggctgcac ccactccatg 240 
cccccgccat 280 



<210> 38 

<211> 303 

<212> DNA 

<213> Homo sapiens 



<400> 38 

catcgagctg gttgtcttct tgcctgccct 
tatcaaggga aaggcaagac tgggacgtct 
cttcacacag gtgaactcgg aagacaaagg 
gaccaattac aatgacngat acgatnagat 
tcctaagtct gtggctcgta tcgccnagct 
taa 



gtgtcgtaaa atgggggtcc cttactgcat 60 
agtccacagg aagacctgca ccactgtcgc 120 
cgctttggct nagctggtgn aagctatcag 180 
ccgccntcac tggggtagca atgtcctggg 24 0 
cgaanaggcn aangctaaag aacttgccac 300 

303 



<210> 39 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 39 

gactcagcgg ctggtgctct tcctgtgcac 

atcaaatccc accaagatnt ttggcttttg 

aactcattga tgtaaatnac tnaaagtgag 



aagcccagca ctccaggtcc caaggcattt 60 
caccgaattc tgggtttggt tccctnaaag 120 
gtctgggtac cctttacatg attccccaga 180 
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cctcanatgg gctaacacgc ttctcttctc cagcagtctt cctntccgtg aagttacctt 240 
ccagattgtt acatggaact gaanacaaag ggagcctcag ctngatttaa atctggagca 300 

<210> 40 

<211> 318 

<212> DNA 

<213> Homo sapiens 



<400> 40 

cccaacacaa 

tgggctgttg 

gatggtgaag 

catcattggg 

acgagccatc 

tctgtctcct 



tggctgagga caaatcagtt ctctgtgacc agacatgaga aggttgccaa 6 0 
ggcgaccaag gccttcccgg agtcttcgtc ctctatgage tctcgcccat 120 
ctgacggaga agcacaggtc cttcacccac ttcctgacag gtgtgtgcgc 180 
ggcatgttca cagtggctgg actcatcgat tcgctcatct accactcagc 240 
cagaaaaaaa ttgatctngg gaagacnacg tagtcaccct cggtncttcc 3 00 
ctttctcc 318 



<210> 41 

<211> 302 

<212> DNA 

<213> Homo sapiens 



<400> 41 

acttagatgg 

cttggcctga 

ccatnaccat 

ataatnagtc 

ccaaacnatt 

tc 



ggtccgttca 
atgttcccca 
ctcggtaacc 
cataatgtaa 
accagacatt 



ggggatacca 
tccggacaca 
tactcttact 
atgcctggcc 
cctcttanat 



gcgttcacat 
ggctgcatgt 
ccacaatgtc 
caagacntat 
tgaaaacgga 



ttttcctttt 
ctctgtnagt 
tatnttcact 
ggcctgagtt 
tntctttccc 



aagaaagggt 6 0 
gtcaaagctg 120 
gcagggctct 180 
tatccnaggc 24 0 
ttggcaaaga 300 
302 



<210> 42 

<211> 299 

<212> DNA 

<213> Homo sapiens 



<400> 42 

cttaataagt 

ggtttggaca 

gattacatcc 

tttgtcacac 

ctaccaggtt 



ttaaggccaa ggcccgttcc attcttctag caactgacgt tgccagccga 60 
tacctcatgt aaatgtggtt gtcaactttg acattcctac ccattccaag 120 
atcgagtagg tcgaacagct agagctgggc gctccggaaa ggctattact 180 
agtatgatgt ggaactcttc cagcgcatag aacacttnat tgggaagaaa 240 
ttccaacaca ggatgatgag gttatgatgc tnacggaacg cgtcgctna 299 



<210> 43 

<211> 305 

<212> DNA 

<213> Homo sapiens 



<400> 43 
ccaacaatgt 
ccttcattgg 
ctgccatgtt 

tggagtitcac 

gggatgccac 
aggca 



caagacagcc 
caatagcaca 
ccgccggaag 
cgaggctgag 
cgcagaaana 



gtctgtgaca 
gccntccggg 
gccttcctcc 
agcaacatga 
ggaggaggat 



tcccacctcg 
agctcttcaa 
actggtacac 
acgacctcgt 
ttcggtnagg 



tggcctcaan 
gcgcatctcg 
aggcgagggc 
ctctnagtat 
aggccgaaga 



atggcagtca 6 0 
gagcagttca 120 
atggacaaga 180 
cagcagtacc 240 
aggaggcctg 300 
305 



<210> 44 
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<211> 399 
<212> DNA 
<213> Homo sapiens 



<400> 44 

tttctgtggg ggaaacctga tctcgacnaa 
tggaacagaa cgaaaacnga tnaatctctg 
cagacacagc tccnaattga ttccttcttt 
ttaacgtatt aagagccnga gactaaacag 
agaagcagcn gcccgcgnaa ttngaagcng 
ttctttatta ggccaacgaa aaaccccgaa 
tccngccnna aaaagaaaga agctttcnga 



attagagaat tttgtcagcg gtatttcggc 60 
tttcctgtat taaagcaact cgatncccag 120 
ngattagcac aacagggaga aagaanatgc 18 0 
agctttgaca tgtatgctta ggaaagagaa 24 0 
tttctgttgc cntgganaaa gaatttgagc 300 
ananaggcnt tacnatacct: tngaaaantc 360 
ttcttaacc 399 



<210> 45 

<211> 440 

<212> DNA 

<213> Homo sapiens 



<400> 45 

gcgggagcag aagctaaagc caaagcccaa 
accagtacca ataacagtgc cagtgccagt 
gccagcctga ccgccactct cacatttggg 
agcaccagtg gcagctctgg tgcctgtggt 
ccttggtttc agtggggaca tctggggctt 
taggaaggcc cangttggag aangatgtgn 
ttattccttt ctgtttgctg gangtcaatt 
canatatngt taatcctgcc 



gagagtggca gtgccagcac tggtgccagt 6 0 
gccagcacca gtggtggctt cagtgctggt 12 0 
ctcttcgctg gccttggtgg agctggtgcc 180 
ttctcctaca agtgagattt taggtatctg 24 0 
anggggcngg gataaggagc tggatgattc 300 
anagtgtgcc aagacactgc ttttggcatt 360 
gacccttnna ntttctctta cttgtgtttt 420 

440 



<210> 46 

<211> 472 

<212> DNA 

<213> Homo sapiens 

<400> 46 

gc.tctgtaat ttcacatttt aaaccttccc 

gtttctctgt tcctcttcac agcaaaaact 

ccactttctc acccccattc tcccctcaat 

tgtggctntt attanagtca ccaaccttat 

cttctcagca gcactcagct ctggtncttg 

cctcataaca atctccttcc cagcctctac 

ctaggctcag atgtagtgta gctcaaccct 

aaaaatgtcc atncntgtcc tgtgagtgat 



ttgacctcac attcctcttc ggccacctct 60 
gttcaaaaga gttgttgatt actttcattt 120 
taactctcct tcatccccat gatgccatta 180 
tctccaaaac anaagcaaca aggactttga 240 
aaacaccccc gttacttgct attcctccta 300 
tgctgccttc tctgagttct tcccagggtc 360 
gctacacaaa gnaatctcct gaaagcctgt 420 
ctnccangna naataacaaa tt 472 



<210> 47 

<211> 550 

<212> DNA 

<213> Homo sapiens 



<400> 47 

ccttcctccg cctggccatc cccagcatgc 
aggtcgggag cttcctcagt ggtctgtatg 
gggggccctg tctgggtgca aggcgacagc 
gtggagctgg gcgctcagtc catcgtgtat 
gcaggcttca gtgtggctgc cagtgtccgg 
gaagcaggca cggaagtcct ctaccgtttc 



tcatgctgtg catggagtgg tgggcctatg 60 
aggatggatg acggggactg gtgggaacct 120 
tgtctttctt caccaggcat cctcggcatg 180 
gaactggcca tcattgtgta catggtccct 240 
gtangaaacg ctctgggtgc tggagacatg 300 
cctgctgatt acagtgctct ttgctgtanc 360 
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cttcagtgtc ctgctgttaa gctgtaagga tcacntgggg tacattttta ctaccgaccg 420 

agaacatcat taatctggtg gctcaggtgg ttccaattta tgctgtttcc cacctctttg 480 

aagctcttgc tgctcaggta cacgccaatt ttgaaaagta aacaacgtgc ctcggagtqq 540 

gaattctgct yy^y-yy 

550 

<210> 48 

<211> 214 

<212> DNA 

<213> Homo sapiens 

<400> 48 

agaaggacat aaacaagctg aacctgccca ^gacgtgtga tatcagcttc tcagatccag 60 

acaacctcct caacttcaag ctggtcatct gtcctgatna gggcttctac nagagtggga 120 

agtttgtgtt cagttttaag gtgggccagg gttacccgca tgatcccccc aaggtgaagt 180 

gtgagacnat ggtctatcac cccnacattg acct 214 

<210> 49 

<2ll> 267 

<212> DNA 

<213> Homo sapiens 

<400> 49 

atctgcctaa aat-ttattca aataatgaaa atnaatctgt tttaagaaat tcagtctttt 60 
agtttttagg acaactatgc acaaatgtac gatggagaat tctttttgga tnaactctag 120 
gtngaggaac ttaatccaac cggagctntt gtgaaggtca gaanacagga gagggaatct 180 
tggcaaggaa tggagacnga gtttgcaaat tgcagctaga gtnaatngtt ntaaatggga 24 0 
ctgctnttgt gtctcccang gaaagtt 267 

<210> 50 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 50 

gactgggtca 

ggagagaacc 

ttgggatctt 

gtctagaaga 

ggagagacca 



aagctgcatg aaaccaggcc 
tgacttctct ttccctctcc 
ctgagcttgt ttccctgctg 
ggcagccctt ctttgtcctc 
anagcctctg atttttaatt 



ctggcagcaa 
ctcctccaac 
ggtgggacag 
tggggtnaat 
tccataanat 



cctgggaatg gctggaggtg 6 0 
attactggaa ctctgtcctg 12 0 
aggacaaagg agaagggagg 18 0 
gagcttgacc tanagtagat 24 0 
gttcnaagta tatntntacc 300 



<210> 51 

<2ll> 300 

<212> DNA 

<213> Homo sapiens 



<400> 51 

gggtaaaatc 

cccctatttc 

aagcgctctc 

gatcaggttn 

gggctctgtt 

<210> 52 
<211> 267 
<212> DNA 



ctgcagcacc 
tagttggtcc 
caagcacccc 
aataaatgga 
agaggggacc 



cactctggaa 
aggattaggg 
cggcctgggg 
actcttcctg 
tccaccctnn 



aatactgctc 
atgtggggta 
gtnagtttct 
tctggcctcc 
ggaagtccga 



ttaattttcc tgaaggtggc 60 
tagggcattt aaatcctctc 120 
catcccgcta ctgctgctgg 180 
aaagcagcct aaaaactgag 24 0 
ggggctnggg aagggtttct 300 
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<213> Homo sapiens 



<400> 52 

aaaatcaact tcntgcatta atanacanat 
cacctatcaa ggaacnnact tgattgcctc 
tacctgaata ccnccgcata actctcaacc 
atgctantcc cgaattcttc attatatcng 
gtatgtnccn taactgccga nncaang 



tctanancag gaagtgaana taattttctg 60 
tattnaacan atatatcgag ttnctatact 120 
nanatncntc nccatgacac tcnttcttna 180 
tgatgttcgn cctgntnata tatcagcaag 240 

267 



<210> 53 

<211> 401 

<212> DNA. 

<213> Homo sapiens 

<400> 53 

agsctttagc atcatgtaga agcaaactgc 
caagattttg tgttttctag ctgtccagga 
agagcaagtg aaaccatttc cagcctaaac 
gacctctaag gctccataat catcattaaa 
tatatacagg attaaaatca acattaaatc 
ttgagcattt taaatagtac agtaggctgg 
caaatagaaa actaaagaaa ttagataggc 

<210> 54 

<211> 401 

<212> DNA 

<213> Homo sapiens 



acctatggct gagataggtg caatgaccta 60 
aaagccatct tcagtcttgc tgacagtcaa 120 
tacataaaag cagccgaacc aatgattaaa 180 
tatgcccaaa ctcattgtga ctttttattt 240 
atcttattta catggccatc ggtgctgaaa 300 
tatacattag gaaatggact gcactggagg 360 
tggaaatgct t 401 



<400> 54 

cccaacacaa tggataaaaa cacttatagt 
aagctacaga ttgtcatagt tgttttcctg 
cagtttgacc tttgtcttct ataatatttc 
tttgattctt aactaaaatt gttctcttaa 
ggtgtatttt ctttacctcc aaggaaagaa 
gcattgtttt ggtataaggt acatattttg 
gtgcatccaa tttattatag ttttgtaagt 

<210> 55 

<211> 933 

<212> DNA 

<213> Homo sapiens 



aaatggggac attcactata atgatctaag 60 
ctttacaaaa ttgctccaga tctggaatgc 120 
ctttttttcc cctctttgaa tctctgtata 180 
atattctgaa tcctggtaat taaaagtttg 24 0 
ctactagcta caaaaaatat tttggaataa 300 
gttgaagaca ccagactgaa gtaaacagct 360 
aacaatatgt a 401 



<400> 55 

tttactgctt ggcaaagtac cctgagcatc 
tcctagggga tgggtcttct attacctggg 
tgtgcatcaa ggaatgcctc cgcctctacg 
caaacccatc acctttccag atggacgctc 
tatttgggct cttcaccaca acccctattt 
gagattctcc agggaaaatt ctgaaaaaat 
tggattaagg aactgcattg ggcagcattt 
attaactctg ctccgcttca agctggctcc 
agttgcctca agtccaagaa tggaatccat 
gtccttttcg tataagaatt aakgagacaa 
aatataatac aaaatatatg tatatggttg 
tgactggttt tgacatccac taacagtaat 



agcagagatg ccgagatgaa atcagggaac 60 
aacacctgag ccagatgcct tacaccacga 120 
caccggtagt aaactatccc ggttactcga 180 
cttacctgca ggaataactg tgtttatcaa 240 
ctgggaagac cctcaggtct ttaacccctt 300 
acatccctat gccttcatac cattctcagc 360 
tgccataatt gagtgtaaag tggcagtggc 420 
agaccactca aggccaccca gctgtcgtca 4 80 
gtgtttgcaa aaaaagtttg ctaattttaa 540 
ttttcctacc aaaggaagaa caaaaggata 600 
tttgacaaat tatataactt aggatacttc 660 
tttaatttct ttgctgtatc tggtgaaacc 720 
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cacaaaaaca cctgaaaaaa ctcaagctga 
ggtaactagt ggtagagtgg ctttcaagca 
cattactttt atctctgcaa atatctgcat 
taataaaaaa tatctgccaa aaaaaaaaaa 



gttccaatgc gaagggaaat gattggtttg 78 0 

tagtttgatc aaaactccac tcagtatctg 840 

gatagcttta ttctcagtta tctttcccca 900 

aaa 933 



<210> 56 

<211> 480 

<212> DNA 

<213> Homo sapiens 



<400> 56 

ggctttgaag 

gcagtcctgg 

gctgctccag 

gaaaccactg 

gcttctacca 

aatggtagag 

attcatgctt 

ctaatcagtt 



catttttgtc 
tactcttggg 
ctgacacgta 
ctgctgcaac 
ctgctcgtaa 
tgtgtccctg 
cctgtgattt 
cattttcttt 



tgtgctccct 
agtttccatc 
tccagctact 
cactgcgacc 
agacattcca 
agatggaatc 
catccaacta 
caaataaaaa 



gatct tcagg 
tttctggtct 
ggtcctgctg 
actgctgctc 
gttttaccca 
agcttgagtc 
cttaccttgc 
ataactatga 



tcaccaccat 
ctgcccagaa 
atgatgaagc 
ctaccactgc 
aatgggttgg 
ttctgcaatt 
ctacgatatc 
gcaacaaaaa 



gaagttctta 60 
tccgacaaca 120 
ccctgatgct 180 
aaccaccgct 240 
ggatctcccg 300 
ggtcacaact 360 
ccctttatct 420 
aaaaaaaaaa 480 



<210> 57 

<211> 798 

<212> DNA 

<213> Homo sapiens 



<400> 57 

agcctacctg 

gactaacttt 

agatgaccag 

gatggtattg 

tgttactgcc 

catggggctt 

tgattataat 

tatatatttt 

aagttatttg 

tttaccttgg 

tcttggattg 

ggctagaata 

aaaattttta 

aaaaaaaaaa 



gaaagccaac 
gtgatatggg 
agtactctta 
ttttcatgag 
attcctattt 
ttttggttgt 
ttttgtgata 
aaatagattt 
gggttgtctg 
tagtttattt 
ttttgcaaat 
ggaagagaga 
atgttaagaa 
aaaaaaaa 



cagtcctcat 
aagtgaaaat 
accccttaga 
cttctagaaa 
acagtatatt 
cctaaactta 
atttctggcc 
gataggtttt 
ggattgtgtg 
gtggatggca 
tacagctgaa 
aaaaatgaaa 
aaccttaaat 



aatggacaag 
agttaacacc 
actgtttttc 
tttcacttgc 
tgagtgaatg 
caaacattcc 
tgattgaagg 
taaattgctt 
aaagaaaatt 
gttttctgta 
atctgtgtca 
tggttgttta 
aaacatgatt 



atccaccagc 
ttgcacgacc 
cttttgtatc 
aagtttattt 
attatatttt 
actcattctg 
aaatttgaga 
tttttcataa 
agaaccccgc 
gttttgggga 
tggattaaac 
ctaattttat 
gatcaatatg 



tcctcctgtg 
aaacgaacga 
tgcaatatgg 
ttgcttcctg 
taaaaagtta 
tttgtaactg 
ggtct.gcatt 
ggtatttata 
tgtatttaca 
ctgtggtagc 
tggcttatgt 
actcccatta 
gaaaaaaaaa 



60 

120 

180 

240 

300 

36 0 

420 

480 

540 

600 

660 

720 

780 

798 



<210> 58 

<211> 280 

<212> DNA 

<213> Homo sapiens 

<400> 58 

99g9cagctc ctgaccctcc acagccacct 
gaggtcccac tgagcctctc gcctgccccc 
cccctgcctg gtcccccaca aggactccca 
tggaccatgg tcgtgaggaa gggctcatgc 
cagaataaac cgagaaggaa accagaaaaa 



ggtcagccac cagctggggc aacgagggtg 6 0 
gccactcgtc tggtgcttgt tgatccaagt 12 0 
tccaggcccc ctctgccctg ccccttgtca 180 
cccttattta tgggaaccat ttcattctaa 240 
aaaaaaaaaa 280 



<210> 59 
<211> 382 
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<212> DNA 

<213> Homo sapiens 



<400> 59 

aggcgggagc agaagctaaa gccaaagccc 

gtaccagtac caataacagt gccagtgcca 

gtgccagcct gaccgccact ctcacatttg 

ccagcaccag tggcagctct ggtgcctgtg 

gttaatcctg ccagtctttc tcttcaagcc 

agcactctag gcagccacta tcaatcaacc 

ccattaaaaa aaaaaaaaaa aa 

<210> 60 

<211> 602 

<212> DNA 

<213> Homo sapiens 



aagagagtgg cagtgccagc actggtgcca 60 

gtgccagcac cagtggtggc ttcagtgctg 120 

ggctcttcgc tggccttggt ggagctggtg 180 

gtttctccta caagtgagat tttagatatt 240 

agggtgcatc ctcagaaacc tactcaacac 300 

gaagttgaca ctctgcatta aatctatttg 360 

382 



<400> 60 

tgaagagccg cgcggtggag ctgctgcccg 

ttgtggtgga gaatagtgcc cagcgggtca 

gggtcccatc ctcgtgagta ccgccactcc 

cttctcgacg gctggcagag atccaagaac 

aggcccgcag gaaggaggag gtctataagc 

gagatgtgtc ccggctggcc tacacccagc 

agcagaagga agagatcacc aagatcttgt 

actccctatc tgggaagctg gaccggacgt 

atgccaagaa ggacgatgct gttcggaagg 

actgcagcca gctcatccag accatcgagg 
ac 



stgggactgc caaccttgcc aagctgcagc 6 0 
tccacttggc gggtcagtgg gagaagcacc 120 
gaaagctgca ggattgcaga gagctggaat 180 
tgcaccagag tgtccgggcg gctgctgaag 24 0 
agctgatgtc agagctggag actctgccca 300 
gcatcctgga gatcgtgggc aacatccgga 36 0 
ctgatacgaa ggagcttcag. aaggaaatca 420 
ttgcggtgac tgatgagctt gtgttcaagg 480 
cctataagta tctagctgct ctgcacgaga 540 
acacaggcac catcatgcgg gaggttcgag 6 00 

602 



<210> 61 

<211> 1368 

<212> DNA 

<213> Homo sapiens 



<400> 61 

ccagtgagcg cgcgtaatac gactcactat 
agcggccgcc cttttttttt tttttttatt 
tgaaaacagc taaaacttaa ttccaagcat 
gttcacaaaa cacattcaat gtttaaacac 
attgtttgct tctaaaaagc ccatttccct 
gcaatgaagt tattttgtac aatctagtta 
aaattgcaca gacgtatgca attttccaag 
ccaaagattg gttgcccata atagacttaa 
tacgaaaatg taaaaaagga aatgtaagtc 
ggatgctaaa gcaaaataaa tgtaggttct 
gcttctttga tatgccttag ggttacccat 
aatgaaaatt gatcaaatat acaccttgtc 
ccaaaaaaag ggtaggcatg aagaaaaaaa 
kgktctgata tggcagacar gatacaagtc 
ggtaatgggc tgacaaggta ttattgccag 
gtttttggat tatatagcac ctagagtctc 
acgctaaact tccaagggaa aatctttcag 
agttgcattg ctactgtgat tttttgaaaa 
agagagaaag attttagtct gtttagcaga 



sg99cgaatt gggtaccggg ccccccctcg 60 
gatcagaatt caggctttat tattgagcaa 120 
gtgtagttaa agtttgcaaa gtgggatatt 18 0 
tatttatttg aagaacaaaa tatatttaaa 240 
ccaagtctaa actttgtaat ttgatattaa 300 
aacaagcaga atagcactag gcagaataaa 360 
atagcattct ttaaattcag ttttcagctt 420 
acatataatg atggctaaaa aaaataagta 480 
cactctcaat ctcataaaag gtgagagtaa 540 
ttttttctgt ttccgtttat catgcaatct 600 
ttaagttaga ggttgtaatg caatggtggg 660 
atttcatttc aaattgcggg ctggaaactt 720 
aaatcmaatc agaacctctt caggggtttg 780 
ccaccaggag atggagcaat tcaaaataag 840 
catgggacag aatgagcaac aggctgaaaa 900 
tgatgtaggg aatttttgtt agtcaaacat 960 
gtagcctaag cttg.cttttc tagagtgatg 1020 
caaactgggt ttgtacaagt gagaaagact 1080 
agccatttta tctgcgtgca catggatcaa 114 0 
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tatttctgat cccctatacc ccaggaaggg caaaatccca aagaaatgtg ttagcaaaat 1200 

tggctgatgc tatcatattg ctatggacat tgatcttgcc caacacaatg gaattccacc 1260 

acactggact agtggatcca ctagttctag agcggccggc caccgcggtg gagctccagc 1320 

ttttgttccc tttagtgagg gttaattgcg cgcttggcgt aatcatnn 1368 

<210> 62 
<211> 924 
<212> DNA 

<213> Homo sapiens 



<400> 62 

caaaggnaca 

gcctccaaaa 

tntnaatgtn 

tnccgnrtnc 

accttaaaga 

aatattaata 

atgtgtatca 

tacttgtttg 

tgagttttct 

ttatggtcaa 

gtgaagcagg 

ttaagtaatt 

tcttccagca 

tcctggtgct 

gggaatcact 

tgtgaaggca 



ggaacagctt 
gcaataggaa 
tcargnanar 
mgnactatnt 
actccgttaa 
ctatcatttt 
tggttcctaa 
aagctaaaga 
agtttggaag 
gtaactgggc 
aactatgatt 
ctcccagggt 
ttgaagaatc 
tgcaccaacc 
gcacgtcgca 
aaaaaaaaaa 



gnaaagtact 
atttaaaaga 
tnccttncaa 
caggtttgaa 
tttctcagag 
tccaaggatt 
taaacatcca 
cctcacaata 
agctattaaa 
tcagcatttt 
atattgacta 
cacacagcta 
ataaatgtaa 
cactagcact 
agagatgttg 
aaaa 



gncatncctn 
tttncactga 
atgncrnctn 
aaactggatc 
cctcagtttc 
gatgtgaaca 
aaatatagta 
gaatcccatc 
taacaacktc 
acattcattg 
cataaatgaa 
gaactggcaa 
ataactgcaa 
tgttctctac 
cttctgatga 



cctgcaggga 
gaaggggncc 
cactnactnr 
tgccacttat 
cttgtctata 
ttaatgaggt 
cttactattg 
cagcccacca 
tagtgtcaat 
tctctttaag 
gaaattgagg 
agcctgggat 
ggccttttcc 
aggggaacat 
attattgttc 



ccagcccttt 60 
acgtttnart 120 
gnatttgggt 180 
cagttatgtg 240 
^gttgggagt 3 00 
gaaatgacag 360 
tcattattat 420 
gacagagytc 480 
tctatacttg 540 
ttctagcaat 600 
ctcagataca 660 
tgatccatga 720 
tcagaagagc 780 
ctgtgggcct 84 0 
ctgtcagtgg 900 
924 



<210> 63 

<2ll> 1079 

<212> DNA 

<213> Homo sapiens 

<400> 63 

agtcccaaga actcaataat ctcttatgtt 
tatttcggtg cctgaatgga aaaatataaa 
tggaatccag ctggcagcta taagcaccgt 
tattaaatgg cctcacatcc tgaatgcagg 
taatgttgaa ttctgaaaac acaaccataa 
tagtacatta tttcctccac agcaaaccta 
acaatcaggg caaaacccac acttgaaaag 
gaagacccca gtgatcacta ggaaatctac 
caaacttcgg ggaataatgt gtccctcttc 
acgaagttta caagcagcag ttattccaag 
ctggcaatgt ttaggtttgc ccaaaaactc 
accacatctg gtaacctctc gatcccttag 
tgactctgga gcctcttgca ttttctttaa 
agcatgccct ctggtgctct ccaaatggga 
attcacatgc acacataaaa ggtttctcat 
ttggcttttt aatttcactc ttgatttctt 
atgacctgta ataatctcat aattacttga 
aataacttcc tgtagaaata tcacatctgg 

<210> 64 
<211> 1001 



ttcttttgaa gacttatttt aaatattaac 60 
cattagctca gagacaatgg ggtacctgtt 120 
tgaaaactct gacaggcttt gtgccctttt 180 
aatgtgttcg tttaaataaa cattaatctt 240 
atcatagttg gtttttctgt gacaatgatc 3 00 
cctttccaga aggtggaaat tgtatttgca 360 
cattttacaa tattatatct aagttgcaca 420 
cacagtccag tttttctaat ccaagaaggt 480 
tgctgctgct ctgaaaaata ttcgatcaaa 54 0 
attagagttc atttgtgtat cccatgtata 600 
ccagacatcc acaatgttgt tgggtaaacc 660 
atttgtatct cctgcaaata taactgtagc 720 
aaccattttt aactgattca ttcgttccgc 780 
tgtcataagg caaagctcat ttcctgacac 840 
cattttggta cttggaaaag gaataatctc 900 
caacattata gctgtgaaat atccttcttc 960 
tctcttcttt aggtagctat aatatggggg 1020 
gctgtacaaa gctaagtagg aacacaccc 1079 
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<212> DNA 

<213> Homo sapiens 
<400> 64 

gaatgtgcaa cgatcaagtc agggtatctg tggtatccac cactttgagc atttatcgat 60 
tctatatgtc aggaacattt caagttatct gttctagcaa ggaaatataa aatacttata 120 
gttaactatg gcctatctac agtgcaacta aaaactagat tttattcctt tccacctgtg 180 
ggtttgtatt catttaccac cctcttttca ttccctttct cacccacaca ctgtgccggg 240 
cctcaggcat atactattct actgtctgtc tctgtaagga ttatcatttt agcttccaca 300 
tatgagagaa tgcatgcaaa gtttttcttt ccatgtctgg cttatttcac ttaacataat 360 
gacctccgct tccatccatg ttatttatat tacccaatag tgttcataaa tatatataca 420 
cacatatata ccacattgca tttgtccaat tattcattga cggaaactgg ttaatgttat 480 
atcgttgcta ttgtggatag tgctgcaata aacacgcaag tggggatata atttgaagag 54 0 
tttttttgtt gatgttcctc caaattttaa gattgttttg tctatgtttg tgaaaatggc 600 
gttagtattt tcatagagat tgcattgaat ctgtagattg ctttgggtaa gtatggttat 660 
tttgatggta ttaatttttt cattccatga agatgagatg tctttccatt gtttgtgtcc 720 
tctacatttt ctttcatcaa agttttgttg tatttttgaa gtagatgtat ttcaccttat 780 
agatcaagtg tattccctaa atattttatt tttgtagcta ttgtagatga aattgccttc 840 
ttgatttctt tttcacttaa ttcattatta gtgtatggaa atgttatgga tttttatttg 900 
ttggttttta atcaaaaact gtattaaact tagagttttt tgtggagttt ttaagttttt 960 
ctagatataa gatcatgaca tctaccaaaa aaaaaaaaaa a lOOi 

<210> 65 

<211> 575 

<212> DNA 

<213> Homo sapiens 

<400> 65 

acttgatata aaaaggatat ccataatgaa tattttatac tgcatccttt acattagcca 60 
ctaaatacgt tattgcttga tgaagacctt tcacagaatc ctatggattg cagcatttca 120 
cttggctact tcatacccat gccttaaaga ggggcagttt ctcaaaagca gaaacatgcc 180 
gccagttctc aagttttcct cctaactcca tttgaatgta agggcagctg gcccccaatg 240 
tggggaggtc cgaacatttt ctgaattccc attttcttgt tcgcggctaa atgacagttt 300 
ctgtcattac ttagattccc gatctttccc aaaggtgttg atttacaaag aggccagcta 360 
atagccagaa atcatgaccc tgaaagagag atgaaatttc aagctgtgag ccaggcagga 420 
gctccagtat ggcaaaggtt cttgagaatc agccatttgg tacaaaaaag atttttaaag 480 
cttttatgtt ataccatgga gccatagaaa ggctatggat tgtttaagaa ctattttaaa '540 
gtgttccaga cccaaaaagg aaaaaaaaaa aaaaa 575 

<210> 66 

<211> 831 

<212> DNA 

<213> Homo sapiens 

<400> 66 

attgggctcc ttctgctaaa cagccacatt gaaatggttt aaaagcaagt cagatcaggt 60 
gatttgtaaa attgtattta tctgtacatg tatgggcttt taattcccac caagaaagag 120 
agaaattatc tttttagtta aaaccaaatt tcacttttca aaatatcttc caacttattt 180 
attggttgtc actcaattgc ctatatatat atatatatat gtgtgtgtgt gtgtgtgcgc 24 0 
gtgagcgcac gtgtgtgtat gcgtgcgcat gtgtgtgtat gtgtattatc agacataggt 300 
ttctaacttt tagatagaag aggagcaaca tctatgccaa atactgtgca ttctacaatg 360 
gtgctaatct cagacctaaa tgatactcca tttaatttaa aaaagagttt taaataatta 420 
tctatgtgcc tgtatttccc ttttgagtgc tgcacaacat gttaacatat tagtgtaaaa 480 
gcagatgaaa caaccacgtg ttctaaagtc tagggattgt gctataatcc ctatttagtt 54 0 
caaaattaac cagaattctt ccatgtgaaa tggaccaaac tcatattatt gttatgtaaa 600 
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tacagagttt taatgcagta tgacatccca caggggaaaa gaatgtctgt agtgggtgac 660 

tgttatcaaa tattttatag aatacaatga acggtgaaca gactggtaac ttgtttgagt 720 

tcccatgaca gatttgagac ttgtcaatag caaatcattt ttgtatttaa atttttgtac 780 

tgatttgaaa aacatcatta aatatcttta aaagtaaaaa aaaaaaaaaa a 831 

<210> 67 

<21I> 590 

<212> DNA 

<213> Homo sapiens 



<400> 67 

gtgctctgtg 

ttaaaggctc 

ttctccctgg 

taaaaatttg 

ttgaattaaa 

ttaaaacaca 

ttgttccttt 

caaagttttc 

gaagcagact 

aaattccttt 



tattttttta 
tttgtgacca 
attagcagtt 
ccttaatgta 
gaaaaaaaaa 
tttcaaatag 
tccctgtgcc 
tgtagttgtg 
tttcttttaa 
ttatggctta 



ctgcattaga 
tgtttccctt 
taaatgaaac 
tcagttcagc 
ttctcaatca 
aagtgagttt 
tgtgtcaaat 
ttagttcttt 
aagaattatt 
tatacctaca 



cattgaatag 
tgtagcaata 
agagttcatc 
tcacaagtat 
tatttttaaa 
gaactgacct 
cttcaagtct 
tgtcatgtct 
tctctttcaa 
tatttaaaaa 



taatttgcgt 
aaatgttttt 
aatgaaatga 
tttaagatga 
atataagact 
tatttatact 
tgctgaaaat 
gtttttggct 
atatttctat 
aaaaaaaaaa 



taagatacgc 
tacgaaaact 
gtatttaaaa 
ttgagaagac 
aaaattgttt 
ctttttaagt 
acatttgata 
gaagaaccaa 
cctttttaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

590 



<210> 68 

<211> 301 
<212> DNA 
<-213> Homo sapiens 



<400> 68 

ttgtgttggg gttccctttt ccggtcggcg 
cgggcctgca ccatgagcgt cccggccttc 
gagcttcgtg cttatctgaa atctaaagga 
ggacttcatg ttgatttagc tcaaattatt 
gataaagatg ttgaaagtgt gatgaacagt 
c 



tggtcttgcg 


agtggagtgt 


ccgctgtgcc 


60 


atcgacatca 


gtgaagaaga 


tcaggctgct 


120 


gctgagattc 


cagaagagaa 


ctcggaaggt 


180 


gaagcctgtg 


atgtgtgtct 


gaaggaggat 


240 


ggggnatcct 


actcttgatc 


cggaanccna 


300 
301 



<210> 69 

<211> 301 

<212> DNA 

<213> Homo sapiens 



<400> 69 

tctatgagca tgccaaggct ctgtgggagg 
ccaacgagta ccagctgatt gactgtgccc 
agcaggctga ctatgtgccg agcgatcagg 
gaatctttga gaccaagttc caggtggacn 
gccagcgcga tgaacgccgc aagtggatcc 
t 



atgaaggagt 


gcgtgcctgc 


tacgaacgct 


60 


agtacttcct 


ggacaagatc 


gacgtgatca 


12 0 


acctgcttcg 


ctgccgtgtc 


ctgacttctg 


180 


aagtcaactt 


ccacatgntt 


gacgtgggtg 


240 


agtgcttcaa 


cgatgtgact 


gccatcatct 


300 
301 



<210> 70 

<211> 201 

<212> DNA 

<213> Homo sapiens 



<400> 70 

gcggctcttc ctcgggcagc ggaagcggcg cggcggtcgg agaagtggcc taaaacttcg 



60 
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gcgttgggtg aaagaaaatg gcccgaacca agcagactgc tcgtaagtcc accggtggga 
aagccccccg caaacagctg gccacgaaag ccgccaggaa aagcgctccc tctaccggcg 
gggtgaagaa gcctcatcgc t 



120 
180 
201 



<210> 71 
<211> 301 
<212> DNA 

<213> Homo sapiens 
<400> 71 

gccggggtag tcgccgncgc cgccgccgct gcagccactg caggcaccgc tgccgccgcc 6 0 

tgagtagtgg gcttaggaag gaagaggtca tctcgctcgg agcttcgctc ggaagggtct 12 0 

ttgttccctg cagccctccc acgggaatga caatggataa aagtgagctg gtacanaaag 18 0 

ccaaactcgc tgagcaggct gagcgatatg atgatatggc tgcagccatg aaggcagtca 240 

cagaacaggg gcatgaactc ttcaacgaag agagaaatct gctctctggt gcctacaaga 300 

a 301 

<210> 72 
<211> 251 
<212> DNA 

<213> Homo sapiens 
<400> 72 

cttggggggt gttgggggag agactgtggg cctggaaata aaacttgtct cctctaccac 6 0 

caccctgtac cctagcctgc acctgtccac atctctgcaa agttcagctt ccttccccag 120 

gtctctgtgc actctgtctt ggatgctctg gggagctcat gggtggagga gtctccacca 180 

gagggaggct caggggactg. gttgggccag ggatgaatat ttgagggata aaaattgtgt 24 0 

aagagccaan g 251 



<210> 73 
<211> 913 
<212> DNA 

<213> Homo sapiens 
<400> 73 

tttttttttt tttttcccag gccctctttt tatttacagt gataccaaac catccacttg 60 

caaattcttt ggtctcccat cagctggaat taagtaggta ctgtgtatct ttgagatcat 120 

gtatttgtct ccactttggt ggatacaaga aaggaaggca cgaacagctg aaaaagaagg 180 

gtatcacacc gctccagctg gaatccagca ggaacctctg agcatgccac agctgaacac 240 

ttaaaagagg aaagaaggac agctgctctt catttatttt gaaagcaaat tcatttgaaa 3 00 

gtgcataaat ggtcatcata agtcaaacgt atcaattaga ccttcaacct aggaaacaaa 360 

attttttttt tctatttaat aatacaccac actgaaatta tttgccaatg aatcccaaag 420 

atttggtaca aatagtacaa ttcgtatttg ctttcctctt tcctttcttc agacaaacac 480 

caaataaaat gcaggtgaaa gagatgaacc acgactagag gctgacttag aaatttatgc 540 

tgactcgatc taaaaaaaat tatgttggtt aatgttaatc tatctaaaat agagcatttt 600 

gggaatgctt ttcaaagaag gtcaagtaac agtcatacag ctagaaaagt ccctgaaaaa 660 

aagaattgtt aagaagtata ataacctttt caaaacccac aatgcagctt agttttcctt 720 

tatttatttg tggtcatgaa gactatcccc atttctccat aaaatcctcc ctccatactg 780 

ctgcattatg gcacaaaaga ctctaagtgc caccagacag aaggaccaga gtttctgatt 840 

ataaacaatg atgctgggta atgtttaaat gagaacattg gatatggatg gtcagcccaa 900 

cacaatggaa ttc 913 



wo 99/33869 



21 



PCT/US98/274I6 



<210> 74 
<211> 351 
<212> DNA 

<213> Homo sapiens 



<400: 
tgtgcncagg 

tgggtttgaa 

aggaaggagn 
nctagctgac 
gactagngga 
cctttagnga 



^ 74 

ggatgggtgg 
ggccanggcc 
catggcaagc 
catcactgtt 
nccactagtt 
gggttaattg 



gcngtggaga 
Qsggggncct 
catagctagg 
ggtgnccagt 
ctagagcggc 
cgcgcttggc , 



ngatgacaga 
caggtccgnt 
ccaccaatca 
ttcccaacac 
cgccaccgcg 
ntaatcatgg 



<210> 75 

<211> 251 

<212> DNA 

<213> Homo sapiens 



aaggctggaa 
tctgnnaagg 
gattaagaaa 
aatggaatnc 
gtggaacccc 
tcataagctg 



ggaanggggg 
gacagccttg 
nnctgagaaa 
caccacactg 
-aacttttgcc 
t 



60 
120 
180 
240 
300 
351 



<400> 75 

tacttgacct tctttgaaaa gcattcccaa aatgctctat tttagataga ttaacattaa 
ccaacataat tttttttaga tcgagtcagc ataaatttct aagtcagcct ctagtcgtgg 
ttcatctctt tcacctgcat tttatttggt gtttgtctga agaaaggaaa gaggaaagca 
ggtgtattat f^""^^^^^^ taccaaatct ttgggattca ttggcaaata atttcagtgt 

<210> 76 
<211> 251 
<212> DNA 

<213> Homo sapiens 



<400> 76 
tatttaataa tacaccacac 
tagtacaatt cgtatttgct 
aggtgaaaga gatgaaccac 
aaaaaaatta tgttggttaa 
caaagaaggt c 



tgaaattatt tgccaatgaa tcccaaagat ttggtacaaa 
ttcctctttc ctttcttcag acaaacacca aataaaatgc 
gactagaggc tgacttagaa atttatgctg actcgatcta 
tgttaatcta tctaaaatag agcattttgg gaatgctttt 



60 
120 
180 
240 
251 



60 
120 
180 
240 



<210> 77 
<211> 351 
<212> DNA 

<213> Homo sapiens 



<400 
actcaccgtg 
gaggcgggag 
tatctctatg 
caaaaattct 
cagaagccca 
aaatggactt 



> 77 
ctgtgtgctg 
gcatgagtga 
actcagaaac 
ttggcctacc 
gatgtggagt 
ccaaagtggt 



tgtgcctgct 
gctacagtgg 
aaaaaatgcc 
tataactgga 
gccagatgtt 
cacctacagg 



gcctggcagc 
gaacaggctc 
aacagtttag 
atgttaaact 
gcagaatact 
atcgtatcat 



<210> 78 
<211> 1592 
<212> DNA 

<213> Homo sapiens 



ctggccctgc 


cgctgctcag 


60 


aggactatct 


caagagannn 


120 


aagccaaact 


caaggagatg 


180 


cccgcgtcat 


agaaataatg 


240 


cactatttcc 


aaatagccca 


300 


atactcgaga 


c 


351 
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<400> 78 

gaattccatt gtgttggggc cctgggggcg gaggggaggg gcccaccacg gccttatttc 60 

cgcgagcgcc ggcactgccc gctccgagcc cgtgtctgtc gggtgccgag ccaactttcc 12 0 

tgcgtccatg cagccccgcc ggcaacggct gcccgctccc tggtccgggc ccaggggccc 180 

gcgccccacc gccccgctgc tcgcgctgct gctgttgctc gccccggtgg cggcgcccgc 24 0 

ggggtccggg gaccccgacg accctgggca gcctcaggat gctggggtcc cgcgcaggct 3 00 

cctgcagcag gcggcgcgcg cggcgcttca cttcttcaac ttccggtccg gctcgcccag 360 

cgcgctgcga gtgctggccg aggtgcagga gggccgcgcg tggattaatc caaaagaggg 420 

atgtaaagtt cacgtggtct tcagcacaga gcgctacaac ccagagtctt tacttcagga 480 

aggtgaggga cgtttgggga aatgttctgc tcgagtgttt ttcaagaatc agaaacccag 54 0 

accaactatc aatgtaactt gtacacggct catcgagaaa aagaaaagac aacaagagga 600 

ttacctgctt tacaagcaaa tgaagcaact gaaaaacccc ttggaaatag tcagcatacc 660 

tgataatcat ggacatattg atccctctct gagactcatc tgggatttgg ctttccttgg 720 

aagctcttac gtgatgtggg aaatgacaac acaggtgtca cactactact tggcacagct 780 

cactagtgtg aggcagtgga aaactaatga tgatacaatt gattttgatt atactgttct 840 

acttcatgaa ttatcaacac aggaaataat tccctgtcgc attcacttgg tctggtaccc 900 

tggcaaacct cttaaagtga agtaccactg tcaagagcta cagacaccag aagaagcctc 960 

cggaactgaa gaaggatcag ctgtagtacc aacagagctt agtaatttct aaaaagaaaa 1020 

aatgatcttt ttccgacttc taaacaagtg actatactag cataaatcat tcttctagta 1080 

aaacagctaa ggtatagaca ttctaataat ttgggaaaac ctatgattac aagtaaaaac 1140 

tcagaaatgc aaagatgttg gttttttgtt tctcagtctg ctttagcttt taactctgga 1200 

agcgcatgca cactgaactc tgctcagtgc taaacagtca ccagcaggtt cctcagggtt 1260 

tcagccctaa aatgtaaaac ctggataatc agtgtatgtt gcaccagaat cagcattttt 1320 

tttttaactg caaaaaatga tggtctcatc tctgaattta tatttctcat tcttttgaac 1380 

atactatagc taatatattt tatgttgcta aattgcttct atctagcatg ttaaacaaag 1440 

ataatatact ttcgatgaaa gtaaattata ggaaaaaaat taactgtttt- aaaaagaact 1500 

tgattatgtt ttatgatttc aggcaagtat tcatttttaa cttgctacct acttttaaat 1560 

aaatgtttac atttctaaaa aaaaaaaaaa aa 1592 



<210> 79 

<211> 401 

<212> DNA 

<213> Homo sapiens 



<400> 79 

catactgtga attgttcttg actccttttc ttgacattca gttttcanaa tttccatctt 60 

tcttctggaa ctaatgtgct gttctcttga ctgcctgctg ggccagcatc cgattgccag 120 

ccagaaacgt cacactgccc aagatggcca ggtacttcaa ggtctggaac atgttgagct 180 

gagtccagta gacatacatg agtcccagca tagcagcatg tcccaggtga aatataatcg 240 

tgctaggagc aaaagtgaag ttggagacat tggcaccaat ccggatccac tagttctaga 3 00 

gcggccgcca ccgcggtgga gctccagctt ttgttccctt tagtgagggt taattgcgcg 360 

cttggcgtaa tcatggncat agctgtttcc tgtgtgaaat t 4 01 

<210> 80 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<400> 80 

aaaaatgaaa catctatttt agcagcaaga ggctgtgagg gatggggtag aaaaggcatc 60 

ctgagagagt tctagaccga cccaggtcct gtggcacact atacgggtca ggaggggtgg 12 0 

aagacaggcc taagctctag gacggtgaat ctcggggcta tttgtggatt tgttagaaac 180 

agacattctt ttggcctttt cctggcactg gtgttgccgg caggtgggca gaagtgagcc 24 0 

accagtcact gttcagtcat tgccaccaca gatcttcagc agaatcttcc ggtaatcccc 300 

t 301 
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<210> 81 

<211> 301 

<212> DNA 

<213> Homo sapiens 



<400> 81 

tagccaggtt gctcaagcta attttattct ttcccaacag 
agcctttaga atgtggcagc aagagaaagc ggactacgca 
agctctcctg gtgttgactt agggatgaag gctccaggct 
cagcagaaga actgntttct ctgataagga tgtcccacca 
gttacacagg tccttcttgc agcagtaagt accgttagct 
t 



gatccatttg 
ggaacgggga 
gctgccagaa 
ttttcaagct 
cattttccct 



gaaaatatca 
gtttgggaga 
atggagtcac 
gttcgttaaa 
caagcgggtt 



<210> 82 

<211> 201 

<212> DNA 

<213> Homo sapiens 

<400> 82 

tcaacagaca aaaaaagttt attgaataca aaactcaaag gcatcaacag tcctgggccc 
aagagatcca tggcaggaag tcaagagttc tgcttcaggg tcggtctggg cagccctgga 
agaagtcatt gcacatgaca gtgatgagtg ccaggaaaac agcatactcc tggaaagtcc 
acctgctggn cactgnttca t 

<210> 83 

<211> 251 

<212> DNA 

<213> Homo sapiens 

<400> 83 

gtaaggagca tactgtgccc atttattata gaatgcagtt aaaaaaaata ttttgaggtt 
agcctctcca gtttaaaagc acttaacaag aaacacttgg acagcgatgc aatggtctct 
cccaaaccgg ctccctctta ccaagtaccg taaacagggt ttgagaacgt tcaatcaatt 
tcttgatatg aacaatcaaa gcatttaatg caaacatatt tgcttctcaa anaataaaac 
cattttccaa a 

<210> 84 

<211> 301 

<212> DNA 

<213> Homo sapiens 



<400> 84 

agtttataat gttttactat gatttagggc ttttttttca 
taaaaactca ggtatcagaa agactcaaaa ggctgttttt 
tccaggcatt aagtgtgtca tacagttgtt gccactgctg 
tgctatgact gacaactact tttctctggg tctgatcaat 
ttcttacggc gtcnataaca aatgcttcaa catcatcagc 
c 

<210> 85 

<211> 201 

<212> DNA 

<213> Homo sapiens 



aagaacaaaa 
cactttgttc 
ttttccaaat 
tttgcagtan 
tccaatctga 



attataagca 
agattttgtt 
gtccgatgtg 
accattttag 
agtcttgctg 



60 
120 
180 
240 
300 
301 



60 
120 
180 
201 



60 
120 
180 
240 
251 



60 
120 
180 
240 
300 
301 
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<400> 85 



tatttgtgta tgtaacattt attgacatct acccactgca agtatagatg aataagacac 
agtcacacca taaaggagtt tatccttaaa aggagtgaaa gacattcaaa aaccaactgc 
aataaaaaag ggtgacataa ttgctaaatg gagtggagga acagtgctta tcaattcttg 
attgggccac aatgatatac c 



60 
120 
180 
201 



<210> 86 
<211> 301 
<212> DNA 

<213> Homo sapiens 
<400> 86 

tttataaaat attttattta cagtagagct ttacaaaaat agtcttaaat taatacaaat 60 
cccttttgca atataactta tatgactatc ttctcaaaaa cgtgacattc gattataaca 120 
cataaactac atttatagtt gttaagtcac cttgtagtat aaatatgttt tcatcttttt 180 
tttgtaataa ggtacatacc aataacaatg aacaatggac aacaaatctt attttgntat 24 0 

tcttccaatg taaaattcat ctctggccaa aacaaaatta accaaagaaa agtaaaacaa 300 
t 301 

<210> 87 

<211> 351 

<212> DNA 

<213> Homo sapiens 

<400> 87 

aaaaaagatt taagatcata aataggtcat tgttgtcaca acacatttca gaatcttaaa 60 

aaaacaaaca ttttggcttt ctaagaaaaa gacttttaaa aaaaatcaat tccctcatca 120 

ctgaaaggac ttgtacattt ttaaacttcc agtctcctaa ggcacagtat ttaatcagaa 180 

tgccaatatt accaccctgc tgtagcanga ataaagaagc aagggattaa cacttaaaaa 240 

aacngccaaa ttcctgaacc aaatcattgg cattttaaaa aagggataaa aaaacnggnt 300 

aaggggggga gcattttaag taaagaangg ccaagggtgg tatgccngga c 351 

<210> 88 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<400> 88 

gttttaggtc tttaccaatt tgattggttt atcaacaggg catgaggttt aaatatatct 60 

ttgaggaaag gtaaagtcaa atttgacttc ataggtcatc ggcgtcctca ctcctgtgca 120 

ttttctggtg gaagcacaca gttaattaac tcaagtgtgg cgntagcgat gctttttcat 180 

ggngtcattt atccacttgg tgaacttgca cacttgaatg naaactcctg ggtcattggg 240 

ntggccgcaa gggaaaggtc cccaagacac caaaccttgc agggtacctn tgcacaccaa 3 00 



<210> 89 

<211> 591 

<212> DNA 

<213> Homo sapiens 

<400> 89 

tttttttttt tttttttatt aatcaaatga ttcaaaacaa ccatcattct gtcaatgccc 60 

aagcacccag ctggtcctct ccccacatgt cacactctcc tcagcctctc ccccaaccct 12 0 

gctctccctc ctcccctgcc ctagcccagg gacagagtct aggaggagcc tggggcagag 18 0 

ctggaggcag gaagagagca ctggacagac agctatggtt tggattgggg aagagattag 24 0 



c 



301 
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gaagtaggtt 
ttattcaaat 
tggctcttac 
cctccctctg 
acagagacct 
acagggtggt 



cttaaagacc 
catttcccat 
acaattttta 
gtggagactc 
ggggaaggaa 
ggtagaggag 



cttttttagt 
agcccagctc 
tccctcaaat 
ctccacccat 
gctgaacttt 
acaagtttta 



accagatatc 
ctctctgttc 
attcatccct 
gagctcccca 
gcagagatgt 
tttccaggcc 



cagccatatt 
tccccctact 
ggcccaacca 
gagcatccaa 
ggacaggtgc 
cacagtctct 



cccagctcca 
accaattctt 
gtcccctgag 
gacagagtgc 
aggctagggt 



<210> 90 

<211> 1996 

<212> DNA 

<213> Homo sapiens 

<400> 90 

tttttttttt ttttttatca aatgaatact 
tttcttttca tcatggagtt accagatttt 
gctaagacat gttaaattct taaatgccat 
tcacaagtct agaatgtgat taagctacaa 
tggtttgtaa caatctagaa gcaatctgtt 
aaccaattta atgccaccaa acataagcct 
ctaaattcta gcagagtaaa cgattccaac 
ttatgacttt gtaatatgta attcataata 
aaaccaaagt agtaggatat tatagaaaag 
gatttcctca aaccttttgt ttttcctcat 
caagatctaa cagggttctt tctagaggat 
acggatcatg ccactcattc atggttgttc 
atacatggca ctccaagggg atgcttcagc 
atacaaaagc cctcgtgtgg tgggaactgt 
aacagtttgg cacctttcca tgacgaggat 
gctattcagc cagaaatagt ttttgtgctg 
accagtgcaa aaacattaca gatgtctggg 
tttatattta atactctatt caaatttaac 
aacttaattc ttctgagagc tctatttgga 
gtatattcat gcacagggca aacaactgta 
catcagtagc tagataccag ttccagtatt 
gcactctcag atgaggatct tgctcagttg 
gaagtttact tcataaatta ctttttccta 
cattcctcca aaggtaagta ttctccaaag 
tgtgattatt gcataatgac actaaatatt 
tcaatgatca gttcaagaaa atgcagatca 
acaaactatg gcccacaggc taaacccagc 
agacataacc acactcattc atttctgtat 
agaactgaat agttgtgaca gagactgtat 
gcccattcta aaaaaagtgt gccaattcct 
aagcctattt gaaatgtgtt ttttttaggg 
99tgactcag caaccaaaca aaagggatac 
tatggacata aaaggaaact ttcagaaaga 
caacacaatg gaattc 

<210> 91 

<211> 911 

<212> DNA 

<213> Homo sapiens 



ttattagaga 
aaaaccaacc 
aatttttgtt 
atctaagtat 
tacaaaagtg 
gctatacctg 
tagaatgtac 
caggtttagg 
atctgatgtt 
gtcttctgtc 
tattagataa 
tatgttccat 
cagaaagtaa 
ggcctcactc 
ctctacaggt 
gatatgattt 
ctaatacaaa 
tcaaagtaat 
aaagcttcac 
ttctgaagca 
ggttaatggt 
ttagactatc 
tatccaggac 
gtaagtattt 
atgtggcttt 
tatataacta 
ctccccttgt 
tgtgtatagc 
ggaccgtgaa 
ggtttacact 
gctgtaatta 
taatttttta 
aaaggaggaa 



cataacacgt 
aacactttct 
caactgcttt 
tcacagatgt 
ccaccaaagc 
ggaaacaaaa 
tgtatatcca 
tgtgtggtat 
aagtataaag 
tttatatttt 
gtaacacttg 
gaactctaat 
agggctgaaa 
ttacttgtcc 
aggttaaaat 
taaaacagat 
aacacataag 
gcaaaataat 
atatccacac 
taaataaact 
ctctggggat 
attagtttga 
tctgcctgag 
gactattaac 
tctgttaggt 
aggttttaca 
ttttataaat 
tgctttcacg 
gcataaatat 
aaaatataga 
ccaattaaaa 
tgaacaatat 
aataaagggg 



ataaaataaa 
catttttaca 
gtcattcaac 
gtcttaggct 
attttaaaga 
aatctcacac 
tatggcacat 
ggagctagga 
tcatatgcct 
tatcacaaac 
atcattaagc 
agcccaactt 
aagtagaaca 
ttccattcaa 
acttttctgt 
tttgtctgtc 
aatctacaac 
tagaagtaaa 
acaaatatgg 
caaagtaaga 
cccattttaa 
ttaagcaact 
aaattttata 
acaaaggcaa 
ttataagttt 
ccagtggttg 
aagttttatt 
ctatactagc 
ttaccatctg 
gtttagtggg 
ttaaggttca 
atttgtattt 
gaaagggacc 



300 
360 
420 
480 
540 
591 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
1996 



<40D> 91 

gccctttttt tttttttttt cttgtttaaa aaaattgttt tcattttaat gatctgagtt 



60 
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agtaacaaac aaatgtacaa aattgtcttt cacatttcca tacattgtgt tatggaccaa 120 

atgaaaacgc tggactacaa atgcaggttt ctttatatcc ttaacttcaa ttattgtcac 180 

ttataaataa aggtgatttg ctaacacatg catttgtgaa cacagatgcc aaaaattata 24 0 

catgtaagtt aatgcacaac caagagtata cactgttcat ttgtgcagtt atgcgtcaaa 300 

tgcgactgac acagaagcag ttatcctggg atatttcact ctatatgaaa agcatcttgg 360 

agaaatagat tgaaatacag tttaaaacaa aaattgtatt ctacaaatac aataaaattt 420 

gcaacttgca catctgaagc aacatttgag aaagctgctt caataaccct gctgttatat 480 

tggttttata ggtatatctc caaagtcatg ggttgggata tagctgcttt aaagaaaata 540 

aatatgtata ttaaaaggaa aatcacactt taaaaatgtg aggaaagctt tgaaaacagt 6 00 

cttaatgcat gagtccatct acatattttc aagttttgga aacagaaaga agtttagaat 660 

tttcaaagta atctgaaaac tttctaagcc attttaaaat aagatttttt tccccatctt 720 

tccaatgttt cctatttgat agtgtaatac agaaatgggc agtttctagt gtcaacttaa 780 

ctgtgctaat tcataagtca ttatacattt atgacttaag agttcaaata agtggaaatt 840 

gggttataat gaaaatgaca agggggcccc ttcagcagcc actcatctga actagtaatc 900 

ccaacacaat g 911 



<210> 92 

<211> 1710 

<212> DNA 

<213> Homo sapiens 



<400> 92 

tttttttttt tttttaactt ttagcagtgt ttatttttgt taaaagaaac caattgaatt .60 

gaaggtcaag acaccttctg attgcacaga ttaaacaaga aagtattact tatttcaact 120 

ttacaaagca tcttattgat ttaaaaagat ccatactatt gataaagttc accatgaaca 180 

tatatgtaat aaggagacta aaatattcat tttacatatc tacaacatgt atttcatatt 240 

tctaatcaac cacaaatcat ataggaaaat atttaggtcc atgaaaaagt ttcaaaacat 300 

taaaaaatta aagttttgaa acaaatcaca tgtgaaagct cattaaataa taacattgac 360 

aaataaatag ttaatcagct ttacttatta gctgctgcca tgcatttctg gcattccatt 420 

ccaagcgagg gtcagcatgc agggtataat ttcatactat gcgaccgtaa agagctacag 480 

ggcttatttt tgaagtgaaa tgtcacaggg tctttcattc tctttcaaag gaagatcact 54 0 

catggctgct aaactgttcc catgaagagt accaaaaaag cacctttctg aaatgttact 600 

gtgaagattc atgacaacat atttttttta acctgttttg aaggagtttt gtttaggaga 660 

ggggatgggc cagtagatgg agggtatctg agaagccctt ttctgtttta aaatataatg 720 

attcactgat gtttatagta tcaacagtct tttaagaaca atgaggaatt aaaactacag 780 

gatacgtgga atttaaatgc aaattgcatt catggatata cctacatctt gaaaaacttg 840 

aaaaggaaaa actattccca aagaaggtcc tgatacttaa gacagcttgc tgggtttgat 900 

caaagcagaa agcatatact ttcaagtgag aaaacagcag tggcaggctt gagtcttcca 960 

agcaatcaaa tctgtaaagc agatggttac tagtaagtct agttatggga gtctgagttc 1020 

taactcatgc tgtgcttgct ggatttgctg gctcttttcc gctctctgtg atgctggact 1080 

ggcttgtcag gtgacatgct ctcaaagttg tgactggact cgttgtgctg ccgggtgtac 1140 

ctcttgcact tgcaggcagt gactactgtg attttgtagg tgcgtgtgct gccatcttgg 1200 

cactgcagct ggattctctg ggtacgggtt ttgtcattga cacaccgcca ctcctgggag 1260 

ctcctcctgc tccagtactt tgttccatag cctcctccaa tccagttagg gagcactggc 1320 

aggggcaagc actcgccagc acacaccagc tccttcagag ggctgatgct ggtgcactgg 13 8 0 

ccatcagaga tgtatttggt ggaacgcagt tcccggcaac ccacttgaac ccgagtgttc 144 0 

cgatccagtc cagtgttact gaaatgcctg cctccatttc tggcttgatt caacgtgctg 1500 

ttgctgctgg ggtgtgctgg aacaggttta accacatgtg aataaaggat ttctgtggca 156 0 

tcatttttaa aagccaaaca gcttttcatt aggatgcatg caaggggaag gagatagaaa 1620 

tgaatggcag gaggaagcat ggtgagtaga ggatttgctt gactgaagag ctggttaatt 1680 

cttttgcctc tgcccaacac aatggaattc 1710 



<210> 93 
<211> 251 
<212> DNA 
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<;213> Homo sapiens 
<400> 93 

cccaccctac ccaaatatta gacaccaaca cagaaaagct agcaatggat tcccttctac 60 

tttgttaaat aaataagtta aatatttaaa tgcctgtgtc tctgtgatgg caacagaagg 120 

accaacaggc cacatcctga taaaaggtaa gaggggggtg gatcagcaaa aagacagtgc 180 

tgtgggctga ggggacctgg ttcttgtgtg ttgcccctca agactcttcc cctacaaata 240 

actttcatat g 251 

<210> 94 

<211> 738 

<212> DNA 

<213> Homo sapiens 



<400> 94 

cccttttttt ttttttttcc acttctcagt ttatttctgg gactaaattt gggtcagagc 60 

tgcagagaag ggatgggccc tgagcttgag gatgaaagtg ccccagggag attgagacgc 12 0 

aacccccgcc ctggacagtt ttggaaattg ttcccagggt tcaactagag agacacggtc 180 

agcccaatgt gggggaagca gaccctgagt ccaggagaca tggggtcagg ggctggagag 24 0 

atgaacattc tcaacatctc tgggaaggaa tgagggtctg aaaggagtgt cagggctgtc 300 

cctgcagcag gtggggatgc cggtgtgctg agtcctggga tgactcagga gttggcctgg 360 

atggtttcct ggatccactt ggtgaacttg cagaggttcg tgtagacacc cggtctgttg 420 

ggccgggcac aagggtaatc tccccaggac acgagtccct gcagggagcc attgcagacc 480 

acaggccccc cagaatcacc ctggcaggag tctctacctg ctttgtcacc ggcgcagaac 540 

atggtgtcat ctatctgtct cgggtaagca tcctcgcacc ttttctgact tagcacgctg 600 

atattcaagc actggaggac cttagggaag tgcacttggg ggctcttggt tgtcccccag 660 

ccagacacca agcactttgt cccagcagag ggacaatgag aggagacgtt gatgggtctg 720 

acatctttag tgggacga 738 
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1. Claims: 1-11.17,18,25-34 and 36-52 all partially 

An isolated nucleic acid comprising the nucleotide sequence 
of SEQ.ID.N.3, encoding an imnunogenic portion of a breast 
protein or a variant of said protein, a vector and a host 
comprising the nucleic acid, its use for the production of 
the polypeptide or antibodies reactive with the polypeptide, 
its use for the manufacture of a vaccine, its use for the 
detection of breast proteins or nucleic acids. 
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